Publications
For a complete list of publications, see also Google Scholar.
Preprint / Submitted
-
Hou, W., and Ji, Z., 2024. A systematic evaluation of large language models for generating programming code. Preprint in arXiv, 2024 March 1. In Journal Review.
-
Hou, W., and Ji, Z., 2024. GPT-4V exhibits human-like performance in biomedical image classification. Preprint in bioRxiv, 2024 January 1. In Journal Review.
-
Hou, W. and Ji, Z., 2023 GeneTuring tests GPT models in genomics. Preprint in bioRxiv, 2023 March 13. In Journal Review.
-
Jackson, C., Cherry, C., Bom, S., Dykema, A., Thompson, E., Zheng, M., Ji, Z., Hou, W., Li, R., Zhang, H. and Choi, J., Rodriguez, F., Weingart, J., Yegnasubramanian, S., Lim, M., Bettegowda, C., Powell, J., Eliesseff, J., Ji, H., and Pardoll, D., 2023. Distinct myeloid derived suppressor cell populations promote tumor aggression in glioblastoma. Preprint in bioRxiv, 2023 Jan 1. In Journal Review.
-
Hou, W. and Ji, Z., 2022. Decomposing spatial heterogeneity of cell trajectories with Paella. Preprint in bioRxiv. Software package: Paella. In Journal Review.
Published / Accepted
Single-cell genomics
- Hou, W.* and Ji, Z.*, Assessing GPT-4 for cell type annotation in single-cell RNA-seq analysis. Nature Methods, 2024 March 25. Software package: GPTCelltype.
- Note 1: Featured in Columbia News Spotlight, Columbia MSPH News, Science Daily, The Medical News, Health Tech World, and 6 other news outlets. It ranked the top #1 when comparing to 69 others from the same source and published within six weeks.
- Note 2: Reviewed in Nature Methods Embedding AI in biology and Toward learning a foundational representation of cells and genes.
- Note 3: As of May/June 2024 , this highly cited paper received enough citations to place it in the top 1% of the academic field of Biology & Biochemistry based on a highly cited threshold for the field and publication year. With the Altmetric Attention Score 284, it ranked the top #1 when comparing to 75 others from the same source and published within six weeks..
-
Hou, W., Ji, Z., Chen, Z., Wherry, E.J., Hicks, S.*, and Ji, H.* A statistical framework for differential pseudotime analysis with multiple single-cell RNA-seq samples. Nature Communications 14, 7286 (2023). Software package: Lamian.
-
Wang, Y., Wang, W., Liu, D., Hou, W., Zhou, T.*, Ji, Z.* GeneSegNet: a deep learning framework for cell segmentation by integrating gene expression and imaging. Genome Biology 24, 235 (2023). Software package: GeneSegNet
-
Dykema, A.G., Zhang, J., Cheung, L.S., Connor, S., Zhang, B., Zeng, Z., Cherry, C.M., Li, T., Caushi, J.X., Nishimoto, M., Munoz, A.J., Ji, Z., Hou, W., Zhan, W., Singh, D., Zhang, T., Rashid, R., Mitchell-Flack, M., Bom, S., Tam, A., Ionta, N., Aye, T.H.K., Wang, Y., Sawosik, C.A., Tirado, L.E., Tomasovic, L.M., Spangler, J.B., Anagnostou, W., Yang, S., Spicer, J., Rayes, R., Taube, J., Brahmer, J.R., Forde, P.M., Yegnasubramanian, S.*, Ji, H.*, Pardoll, M.*, and Smith K.N.*(2023). Lung tumor–infiltrating Treg have divergent transcriptional profiles and function linked to checkpoint blockade response. Science Immunology, 8(87). PMID: 37713507.
-
Hou, W., Ji, Z.* (2022). Palo: spatially-aware color palette optimization for single-cell and spatial data. Bioinformatics, June 01, 2022. Software package: Palo. PMID: 35642896. PMCID: PMC9272793.
-
Hou, W., Ji, Z.* (2022). Single-cell Unbiased Visualization with SCUBI. Cell Reports Methods, 100135, 2022. Software package: scubi. PMID: 35224531. PMCID: PMC8871596
-
Caushi, J.X., Zhang, J., Ji, Z., Vaghasia, A., Zhang, B., Hsiue, E., Mog, B., Hou, W., Justesen, S., Blosser, R., Tam, A., Anagnostou, V., Cottrell, T.R., Guo, H., Chan, H., Singh, D., Thapa, S., Dykema, A., Choudhury, C., Aparicio, L., Cheung, L., Lanis, M., Belcaid, Z., Asmar, M.E., Illei, P., Brock, M., Ha, J., Bush, E., Park, B., Bott, M., Naidoo, J., Marrone, K.A., Reuss, J.E., Velculescu, V.E., Chaft, J.E., Kinzler, K.W., Zhou, S., Vogelstein, B., Taube, J.M., Merghoub, T., Brahmer, J.R., Hellmann, M.D., Forde, P.M., Yegnasubramanian, S.*, Ji, H.*, Pardoll, D.M.*, Smith, K.N.* (2021). Transcriptional programs of neoantigen-specific TIL in anti-PD-1-treated lung cancers. Nature, July 21, 2021. PMID: 34290408 PMCID: PMC8338555.
-
Hou, W., Ji, Z., Ji, H.* and Hicks, S.C.*, (2020). A Systematic Evaluation of Single-cell RNA-sequencing Imputation Methods. Genome Biology 21, 218 (2020), doi: 10.1186/s13059-020-02132-x. PMID: 32854757. PMCID: PMC7450705. Links to: Code, Twitter.
- Ji, Z., Zhou, W., Hou, W. and Ji, H.*, (2020). SCATE: Single-cell ATAC-seq Signal Extraction and Enhancement. Genome Biology, 21,161 (2020). doi: 10.1186/s13059-020-02075-3. PMID: 32620137. PMCID: PMC7333383. Links to: Software package: SCATE, SCATEData.
Boolean networks
-
Hou, W., Ruan, P., Ching, W.K. and Akutsu, T.*, (2019). On the number of driver nodes for controlling a Boolean network when the targets are restricted to attractors. Journal of Theoretical Biology, 463, pp.1-11. doi:10.1016/j.jtbi.2018.12.012.
-
Hou, W., Tamura, T., Ching, W.K. and Akutsu, T.*, (2016). Finding and analyzing the minimum set of driver nodes in control of Boolean networks. Advances in Complex Systems, 19(03), p.1650006. doi: 10.1142/S0219525916500065.
Glycosylation networks
- Hou, W., Qiu, Y., Hashimoto, N., Ching, W.K. and Aoki-Kinoshita, K.F.*, (2016). A systematic framework to derive N-glycan biosynthesis process and the automated construction of glycosylation networks. BMC Bioinformatics, 17(7), p.240. doi:10.1186/s12859-016-1094-6.
Epigenetics
- Xu, R., Hong, X.*, Zhang, B., Huang, W., Hou, W., Wang, G., Wang, X., Igusa, T., Liang, L., Ji, H. (2021) DNA methylation mediates the effect of maternal smoking on offspring birthweight. Clinical Epigenetics, 13:47. doi: 10.1186/s13148-021-01032-6.
Machine learning
-
Jiang, H., Qiu, Y., Hou, W., Cheng, X., Yim, M. and Ching, W.K.*, (2018). Drug side-effect profiles prediction: from empirical risk minimization to structural risk minimization. IEEE/ACM Transactions on Computational Biology and Bioinformatics. doi:10.1109/TCBB.2018.2850884.
-
Jiang, H.*, Ching, W.K., Cheung, W.S., Hou, W. and Yin, H., (2017). Hadamard kernel SVM with applications for breast cancer outcome predictions. BMC Systems Biology, 11(7), p.138. doi:10.1186/s12918-017-0514-1.
-
Jiang, H.*, Ching, W.K. and Hou, W., (2016). On orthogonal feature extraction model with applications in medical prognosis. Applied Mathematical Modelling, 40(19-20), pp.8766-8776. doi:10.1016/j.apm.2016.05.011.
-
Hou, W.*, Chen,Y. and Zhang, Y., (2012) Investigation of Heavy Metal Pollution on Urban Topsoil. Economic Life Digest (in Chinese), 15, pp.204-206. [ISSN1009 – 5535]
Obesity and maternal health
-
Hou, W.*, Zhang, M., Ji, Y., Hong, X., Wang, G., Xu, R., Liang, L., Saria, S. and Ji, H. (2022) A prospective birth cohort study of maternal prenatal cigarette smoking assessed by self-report and biomarkers on childhood risk of overweight or obesity. Precision Nutrition, 1(3), e00017, doi: 10.1097/PN9.0000000000000017.
-
Huang, W., Igusa, T., Wang, G., Buckley, J.P., Hong, X., Bind, E., Steffens, A., Mukherjee, J., Haltmeier, D., Ji, Y., Xu, R., Hou, W., Fan, Z., and Wang, X.* (2022) In-utero co-exposure to toxic metals and micronutrients on childhood risk of overweight or obesity: new insight on micronutrients counteracting toxic metals. International Journal of Obesity, 46, 1435–1445. PMID: 35589962. PMCID: PMC9329205.
Psychiatry
- Ji, Y., Azuine, R.E., Zhang, Y., Hou, W., Hong, X., Wang, G., Riley, A., Pearson, C., Zuckerman, B. and Wang, X.*, (2019). Association of cord plasma biomarkers of in utero acetaminophen exposure with risk of attention-deficit/hyperactivity disorder and autism spectrum disorder in childhood. JAMA Psychiatry, pp.1-11. doi: 10.1001/jamapsychiatry.2019.3259. Featured in NIH news, Reuters health, MedPage Today, meaww, LinksMedicus, TechnologyNetworks.
Posters
-
A computational framework for differential pseudotime analysis across conditions with multiple single-cell RNA-seq samples reveals T cell immune dynamics associated with COVID-19 disease severity, CSHL Systems Immunology 2021, Virtual, Apr 20 - 23, 2021.
-
A systematic evaluation of single-cell RNA-seq imputation methods, 13th Annual Symposium and Poster Session on Genomics and Bioinformatics, Johns Hopkins University, Baltimore, USA, Oct 17, 2019. (first-place winning poster)
-
Causal gene regulatory network construction using single-cell RNA-seq and single-cell ATAC-seq data, 11th annual RECOMB/ISCB Conference on Regulatory & Systems Genomics with DREAM Challenges, New York University, New York, USA, Dec 8-10, 2018.
-
On orthogonal feature extraction model with applications in cancer prediction, University of Cadiz, Cadiz, Spain, May 18, 2016.