Publications

Most papers are freely available (PDF links). Just ask me for the others.

Preprints

  • Burk, L., Zobolas, J., Bischl, B., Bender, A., Wright, M. N. & Sonabend, R. (2024). A large-scale neutral comparison study of survival models on low-dimensional data. arXiv. https://arxiv.org/abs/2406.04098. PDF
  • Ewald, F. K., Bothmann, L., Wright, M. N., Bischl, B., Casalicchio, G. & König, G. (2024). A guide to feature importance methods for scientific inference. arXiv (accepted at xAI 2024). https://arxiv.org/abs/2404.12862. PDF
  • Koenen, N. & Wright, M. N. (2024). Toward understanding the disagreement problem in neural network feature attribution. arXiv (accepted at xAI 2024). https://arxiv.org/abs/2404.11330. PDF
  • Dandl, S., Blesch, K., Freiesleben, T., König, G., Kapar, J., Bischl, B. & Wright, M. N. (2024). CountARFactuals – Generating plausible model-agnostic counterfactual explanations with adversarial random forests. arXiv (accepted at xAI 2024). https://arxiv.org/abs/2404.03506. PDF
  • Langbein, S. H., Krzyziński, M., Spytek, M., Baniecki, H., Biecek, P. & Wright, M. N. (2024). Interpretable machine learning for survival analysis. arXiv. https://arxiv.org/abs/2403.10250. PDF
  • Koenen, N. & Wright, M. N. (2023). Interpreting deep neural networks with the package innsight. arXiv (accepted at Journal of Statistical Software). https://arxiv.org/abs/2306.10822. PDF
  • Dijkstra, L., Schink, T., Linder, R., Schwaninger, M., Pigeot, I., Wright, M. N., & Foraita, R. (2022). A discovery and verification approach for pharmacovigilance using electronic health care data. medRxiv. https://www.medrxiv.org/content/10.1101/2022.05.10.22274885v1. PDF

Journal Articles, Conference and Workshop Papers

  • Blesch, K. & Wright, M. N. (2024). arfpy: A python package for density estimation and generative modeling with adversarial random forests. Journal of Open Research Software 12:7. https://doi.org/10.5334/jors.492. PDF
  • Spytek, M., Krzyziński, M., Langbein, S. H., Baniecki, H., Wright, M. N. & Biecek, P. (2023). survex: an R package for explaining machine learning survival models. Bioinformatics 39. https://doi.org/10.1093/bioinformatics/btad723. PDF
  • Molnar, C., Freiesleben, T., König, G., Herbinger, J., Reisinger, T., Casalicchio, G., Wright, M. N. & Bischl, B. (2023). Relating the partial dependence plot and permutation feature importance to the data generating process. World Conference on Explainable Artificial Intelligence (xAI) 2023. https://doi.org/10.1007/978-3-031-44064-9_24. PDF
  • Blesch, K., Wright, M. N. & Watson, D. S. (2023). Unfooling SHAP and SAGE: Knockoff imputation for Shapley values. World Conference on Explainable Artificial Intelligence (xAI) 2023. https://doi.org/10.1007/978-3-031-44064-9_8. PDF
  • Watson, D. S., Blesch, K., Kapar, J. & Wright, M. N. (2023). Adversarial random forests for density estimation and generative modeling. Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (AISTATS) PMLR 206:5357-5375. https://proceedings.mlr.press/v206/watson23a.html. PDF
  • Hiabu, M., Meyer J. T. & Wright, M. N. (2023). Unifying local and global model explanations by functional decomposition of low dimensional structures. Proceedings of The 26th International Conference on Artificial Intelligence and Statistics (AISTATS) PMLR 206:7040-7060. https://proceedings.mlr.press/v206/hiabu23a.html. PDF
  • Blesch, K., Watson, D. S. & Wright, M. N. (2023). Conditional feature importance for mixed data. AStA Adv Stat Anal. https://doi.org/10.1007/s10182-023-00477-9. PDF
  • Bonannella, C., Hengl, T., Heisig, J., Parente, L., Wright, M. N. Herold, M. & de Bruin, S. (2022). Forest tree species distribution for Europe 2000-2020: mapping potential and realized distributions using spatiotemporal machine learning. PeerJ 10:e13728. https://doi.org/10.7717/peerj.13728. PDF
  • Mehlig, K., Foraita, R., Nagrani, R., Wright, M. N., De Henauw, S., Molnár, D., Moreno, L. A., Russo, P., Tornaritis, M., Veidebaum, T., Lissner, L., Kaprio, J. & Pigeot, I., on behalf of the IDEFICS and I.Family consortia (2023). Genetic associations vary across the spectrum of fasting serum insulin: results from the European IDEFICS/I.Family children’s cohort. Diabetologia 66:1914–1924. https://doi.org/10.1007/s00125-023-05957-w. PDF
  • Baudeu, R., Wright, M. N. & Loecher, M. (2022). Are SHAP values biased towards high-entropy features? ECML PKDD Workshop on eXplainable Knowledge Discovery in Data Mining. https://doi.org/10.1007/978-3-031-23618-1_28. PDF
  • Watson, D. S. & Wright, M. N. (2021). Testing conditional independence in supervised learning algorithms. Machine Learning 110:2107-2129. https://doi.org/10.1007/s10994-021-06030-6. PDF
  • Askland, K. D., Strong, D., Wright, M. N. & Moore, J. H. (2021). The translational machine: A novel machine-learning approach to illuminate complex genetic architectures. Genetic Epidemiology 45:485-536. https://doi.org/10.1002/gepi.22383. PDF
  • Hüls, A. *, Wright, M. N. *, Bogl, L. H., Kaprio, J., Lissner, L., Molnár, D., Moreno, L., De Henauw, S., Siani, A., Veidebaum, T., Ahrens, W., Pigeot, I. & Foraita, R. (2021). Polygenic risk for obesity and its interaction with lifestyle and sociodemographic factors in European children and adolescents. International Journal of Obesity 45:1321-1330. https://doi.org/10.1038/s41366-021-00795-5. PDF *Equal contribution
  • Wright, M. N., Kusumastuti S., Mortensen, L. H., Westendorp, R. G. J. & Gerds, T. A. (2021). Personalised need of care in an ageing society: The making of a prediction tool based on register data. Journal of the Royal Statistical Society: Series A (Statistics in Society) 184:1199-1219. https://doi.org/10.1111/rssa.12644. PDF
  • Koenen, N., Wright, M. N., Maass, P. & Behrmann, J. (2021). Generalization of the change of variables formula with applications to residual flows. ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models. https://openreview.net/forum?id=msCiI5dejr. PDF
  • Breau, B., Brandes, B., Wright, M. N., Buck, C., Vallis, L. A. & Brandes, M. (2020). Association of individual motor abilities and accelerometer-derived physical activity measures in preschool-aged children. Journal for the Measurement of Physical Behaviour 4:227-235. https://doi.org/10.1123/jmpb.2020-0065. Free Postprint
  • Brandes, B., Buck, C., Wright, M. N., Pischke, C.R. & Brandes, M. (2020). Impact of “JolinchenKids—Fit and healthy in daycare” on children’s objectively measured physical activity: A cluster-controlled study. Journal of Physical Activity and Health 17:1025-1033. https://doi.org/10.1123/jpah.2019-0536. Free Postprint
  • Schmid, M., Welchowski T., Wright, M. N. & Berger, M. (2020). Discrete-time survival forests with Hellinger distance decision trees. Data Mining and Knowledge Discovery 34:812-832. https://doi.org/10.1007/s10618-020-00682-z. PDF
  • Boulesteix, A-L., Wright, M. N., Hoffmann, S. & König, I. R. (2020). Statistical learning approaches in the genetic epidemiology of complex diseases. Human Genetics 139:73–84. https://doi.org/10.1007/s00439-019-01996-9. Free read-only version
  • Weinhold, L., Schmid, M., Mitchell R., Maloney, K. O., Wright, M. N. & Berger, M. (2020). A random forest approach for modeling bounded outcome variables. Journal of Computational and Graphical Statistics 29:639-658. https://doi.org/10.1080/10618600.2019.1705310. Free Preprint
  • Hornung, R. & Wright, M. N. (2019). Block Forests: random forests for blocks of clinical and omics covariate data. BMC Bioinformatics 20:358. https://doi.org/10.1186/s12859-019-2942-y. PDF
  • Steenbock, B., Wright, M. N., Wirsik, N. & Brandes, M. (2019). Accelerometry-based prediction of energy expenditure in preschoolers. Journal for the Measurement of Physical Behaviour 2:94-102. https://doi.org/10.1123/jmpb.2018-0032. Free Preprint
  • Wright, M. N. & König, I. R. (2019). Splitting on categorical predictors in random forests. PeerJ 7:e6339. https://doi.org/10.7717/peerj.6339. PDF
  • Probst, P., Wright, M. N. & Boulesteix, A-L. (2019). Hyperparameters and tuning strategies for random forest. WIREs Data Mining and Knowledge Discovery 9:e1301. https://doi.org/10.1002/widm.1301. Free Preprint
  • Hengl, T., Nussbaum, M., Wright, M. N., Heuvelink, G. B. M. & Gräler, B. (2018). Random forest as a generic framework for predictive modeling of spatial and spatio-temporal variables. PeerJ 6:e5518. https://doi.org/10.7717/peerj.5518. PDF
  • Fouodo, C. J. K., König, I. R., Weihs, C., Ziegler A. & Wright, M. N. (2018). Support vector machines for survival analysis with R. The R Journal 10:412–423. https://doi.org/10.32614/RJ-2018-005. PDF
  • Nembrini, S., König, I. R. & Wright, M. N. (2018). The revival of the Gini Importance? Bioinformatics 34:3711–3718. https://doi.org/10.1093/bioinformatics/bty373. PDF
  • Hirose, M., Schilf, P., Gupta, Y., Zarse, K., Künstner, A., Fähnrich, A., Busch, H., Yin, J., Wright, M. N., Ziegler, A., Vallier, M., Belheouane, M., Baines, J. F., Tautz, D., Johann, K., Oelkrug, R., Mittag, J., Lehnert, H., Othman, A., Jöhren, O., Schwaninger, M., Prehn, C., Adamski, J., Shima, K., Rupp, J., Häsler, R., Fuellen, G., Köhling, R., Ristow, M. & Ibrahim, S. M. (2018). Low-level mitochondrial heteroplasmy modulates DNA replication, glucose metabolism and lifespan in mice. Scientific Reports 8:5872. https://doi.org/10.1038/s41598-018-24290-6. PDF
  • Foraita, R., Dijkstra, L., Falkenberg, F., Garling, M., Linder, R., Pflock, R., Rizkallah, M. R., Schwaninger, M., Wright, M. N. & Pigeot, I. (2018). Detection of drug risks after approval: Methods development for the use of routine statutory health insurance data. Bundesgesundheitsblatt 61:1075–1081. https://doi.org/10.1007/s00103-018-2786-z. PDF
  • Wright, M. N. & Ziegler, A. (2017). ranger: A fast implementation of random forests for high dimensional data in C++ and R. Journal of Statistical Software 77:1–17. https://doi.org/10.18637/jss.v077.i01. PDF
  • Hengl, T., Mendes de Jesus, J., Heuvelink, G. B., Ruipérez Gonzalez, M., Kilibarda, M., Blagotić, A., Shangguan, W., Wright, M. N., et al. (2017). SoilGrids250m: Global gridded soil information based on machine learning. PLOS ONE 12:e0169748. https://doi.org/10.1371/journal.pone.0169748. PDF
  • Wright, M. N., Dankowski, T. & Ziegler, A. (2017). Unbiased split variable selection for random survival forests using maximally selected rank statistics. Statistics in Medicine 36:1272–1284. https://doi.org/10.1002/sim.7212. Free Preprint
  • Hirose, M., Schilf, P., Gupta, Y., Wright, M. N., Jöhren, O., Wagner, A. E., Sina, C., Ziegler, A., Ristow, M. & Ibrahim, S. M. (2016). Lifespan effects of mitochondrial mutations. Nature 540:E13–E14. https://doi.org/10.1038/nature20778.
  • Schmid, M., Wright, M. N. & Ziegler, A. (2016). On the use of Harrell’s C for clinical risk prediction via random survival forests. Expert Systems with Applications 63:450–459. https://doi.org/10.1016/j.eswa.2016.07.018. Free Preprint
  • Schirmer, J. H., Wright, M. N., Herrmann, K., Laudien, M., Nölle, B., Reinhold-Keller, E., Bremer, J. P., Moosig, F. & Holle, J. U. (2016). Myeloperoxidase-ANCA associated Granulomatosis with polyangiitis is a clinically distinct subset within ANCA-associated vasculitis. Arthritis & Rheumatology, 68:2953–2963. https://doi.org/10.1002/art.39786. PDF
  • Wright, M. N., Ziegler, A. & König, I. R. (2016). Do little interactions get lost in dark random forests? BMC Bioinformatics 17:145. https://doi.org/10.1186/s12859-016-0995-8. PDF
  • Schirmer, J. H., Wright, M. N., Vonthein, R., Herrmann, K., Nölle. B., Both, M., Henes, F., Arlt, A., Gross, W. L., Schinke, S., Reinhold-Keller, E., Moosig, F. & Holle, J. U. (2016). Clinical presentation and long-term outcome of 144 patients with microscopic polyangiitis in a monocentric German cohort. Rheumatology (Oxford) 55:71–79. https://doi.org/10.1093/rheumatology/kev286. PDF
  • Wright, M. N. & Ziegler, A. (2015). Multiple censored data in dentistry: A new statistical model for analyzing lesion size in randomized controlled trials. Biometrical Journal 57:384–394. https://doi.org/10.1002/bimj.201400118.
  • Paulick, C., Wright, M. N., Verleger, R. & Keller, K. (2014). Decomposition of 3-way arrays: A comparison of different PARAFAC algorithms. Chemometrics and Intelligent Laboratory Systems 137:97–109. https://doi.org/10.1016/j.chemolab.2014.06.009.

Book Chapters

  • Wright, M. N. (2023). Feature Selection. In: Bischl, B., Sonabend, R., Kotthoff, L., Lang, M., (Eds.) Applied Machine Learning Using mlr3 in R. CRC Press, Boca Raton, Florida. https://mlr3book.mlr-org.com/feature_selection.html. HTML
  • Dandl, S., Biecek, P., Casalicchio, G. & Wright, M. N. (2023). Model Interpretation. In: Bischl, B., Sonabend, R., Kotthoff, L., Lang, M., (Eds.) Applied Machine Learning Using mlr3 in R. CRC Press, Boca Raton, Florida. https://mlr3book.mlr-org.com/model_interpretation.html. HTML
  • Binder, M., Pfisterer, F., Becker, M. & Wright, M. N. (2023). Non-sequential Pipelines and Tuning. In: Bischl, B., Sonabend, R., Kotthoff, L., Lang, M., (Eds.) Applied Machine Learning Using mlr3 in R. CRC Press, Boca Raton, Florida. https://mlr3book.mlr-org.com/sequential_pipelines_and_tuning.html. HTML
  • Wright, M. N.*, Gola D.* & Ziegler A. (2017). Preprocessing and Quality Control for Whole-Genome Sequences from the Illumina HiSeq X Platform. In: Elston, R. C. (Ed.) Statistical Human Genetics (2nd edn.). Methods in Molecular Biology 1666:629-647. Humana Press, New York. https://doi.org/10.1007/978-1-4939-7274-6_30. HTML *Equal contribution

Editorials

Other

  • Pigeot, I., Fröhlich, H., Intemann, T., Prause, G. & Wright, M. N. (2023). KI und die Nationale Forschungsdateninfrastruktur für personenbezogene Gesundheitsdaten (NFDI4HEALTH). In Dössel, O., Schäffter, T., Rutert, B. (Hrsg.): Künstliche Intelligenz in der Medizin. Berlin-Brandenburgische Akademie der Wissenschaften 11:62-74. ISBN 978-3-949455-18-6. PDF