Professor Mark Lee BA (Hons), MSc, PhD

Dr Mark Lee

School of Computer Science
Professor of Artificial Intelligence

Contact details

Address
School of Computer Science
University of Birmingham
Edgbaston
Birmingham
B15 2TT
UK

Professor Mark Lee is a professor of artificial intelligence in the School of Computer Science. His research interests are focussed on Natural Language Processing. He is specifically interested in Sentiment Analysis of text, the automatic identification and understanding of metaphor and the effects of pragmatic inference in dialogue processing. More recently he has been investigating the extraction of constraints from text to build formal models for reasoning. His research has been funded by the Home Office, RCUK, European Union and various industries.

For more information, please see Mark's personal homepage.

Biography

Mark graduated from Sussex University with a BA (hons) in Computing and Artificial Intelligence and then completed a MSc in System Design at the University of Manchester before completing a PhD in Natural Language Processing at the University of Sheffield. He joined the University of Birmingham as a Research fellow in 1998 and became a lecturer in 2000.

Postgraduate supervision

  • Natural Language Processing

Research

Professor Lee's research interests are focussed on the computational processing of natural language text. He has specific interest in:

  • Sentiment Analysis
  • Semantics/Pragmatics of natural language, especially figurative language
  • Medical Informatics involving Natural Language Processing

NLP like many other areas of AI has been transformed by the application of deep neural models and the use of such models to capture rich semantic information. His current interests are 1) in the theoretical understanding of what kinds of linguistic information can be captured, and 2) developing practical applications using these models, notably in healthcare and psychology.

Publications

Recent publications

Article

Abbas, A, Lee, M, Shanavas, N & Kovatchev, V 2024, 'Clinical concept annotation with contextual word embedding in active transfer learning environment', Digital Health .

Gokhan, T, Price, MJ & Lee, M 2024, 'Graphs in clusters: a hybrid approach to unsupervised extractive long document summarization using language models', Artificial Intelligence Review, vol. 57, no. 7, 189. https://doi.org/10.1007/s10462-024-10828-w

Baqir, A, Ali, M, Jaffar, S, Sherazi, HHR, Lee, M, Bashir, AK & Al Dabel, MM 2024, 'Identifying COVID-19 survivors living with post-traumatic stress disorder through machine learning on Twitter', Scientific Reports, vol. 14, no. 1, 18902. https://doi.org/10.1038/s41598-024-69687-8

Chen, S, Quinton, M, Alharbi, A, Bao, H, Bell, B, Carter, B, Duignan, M, Heyes, A, Kaplanidou, K, Karamani, M, Kennelly, J, Kokolakakis, T, Lee, M, Liang, X, Maharaj, B, Mair, J, Smith, A, van Blerk, L & Veldhuijzen van Zanten, J 2024, 'Propositions and recommendations for enhancing the legacies of major sporting events for disadvantaged communities and individuals', Event Management. https://doi.org/10.3727/152599524X17077053867647

Chapter (peer-reviewed)

Laureano De Leon, FA, Tayyar Madabushi, H & Lee, M 2024, Code-Mixed Probes Show How Pre-Trained Models Generalise on Code-Switched Text. in N Calzolari, M-Y Kan, V Hoste, A Lenci, S Sakti & N Xue (eds), Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). International conference on computational linguistics, LREC proceedings, European Language Resources Association (ELRA), pp. 3457–3468, 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, Torino, Italy, 20/05/24. <https://aclanthology.org/2024.lrec-main.307>

Conference contribution

Al Amer, S, Lee, M & Smith, P 2024, Adopting Ensemble Learning for Cross-lingual Classification of Crisis-related Text On Social Media. in Proceedings of The Seventh Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2024). Seventh Workshop on Technologies for Machine Translation of Low-Resource Languages , Bangkok, Thailand, 15/08/24.

Li, W, Li, L, Lee, M & Sun, S 2024, ALS: Adaptive Layer Sparsity for Large Language Models via Activation Correlation Assessment. in Advances in Neural Information Processing Systems 37 (NeurIPS 2024). Advances in neural information processing systems, NeurIPS, Thirty-Eighth Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, 10/12/24.

Gamboa, LC & Lee, M 2024, A Novel Interpretability Metric for Explaining Bias in Language Models: Applications on Multilingual Models from Southeast Asia. in 38th Pacific Asia Conference on Language, Information and Computation. Proceedings of the Pacific Asia Conference on Language, Information and Computation, Association for Computational Linguistics, ACL, Tokyo, Japan, 38th Pacific Asia Conference on Language, Information and Computation, Tokyo, Japan, 7/12/24.

Yang, L, Zhou, S, Cheng, J, Zhang, F, Wan, J, Wang, S & Lee, M 2024, DAEA: Enhancing Entity Alignment in Real-World Knowledge Graphs Through Multi-Source Domain Adaptation. in The 31st International Conference on Computational Linguistics.

Abbas, A, Lee, M, Kovatchev, V & Shanavas, N 2025, MTNER: Multiple Tender Named Entities Recognition and Classification from unstructured tender documents. in Proceedings of the 19th International Conference on Ubiquitous Information Management and Communication. IEEE Press / Wiley, International Conference on Ubiquitous Information Management and Communication, Bangkok, Thailand, 3/01/25.

Abbas, A, Lee, M, Shanavas, N, Kovatchev, V & Ali, M 2024, Structured Tender Entities Extraction from Complex Tables with Few-short Learning. in COLING 2025 Proceedings of the Workshop on Regulatory Natural Language Processing (REGNLP 2025) . Association for Computational Linguistics, ACL, Regulatory Natural Language Processing Workshop (RegNLP) 2025, Abu Dhabi, United Arab Emirates, 20/01/25.

Other contribution

Chen, S, Liang, X, Quinton, M, Veldhuijzen van Zanten, J & Lee, M 2024, Major Sporting Event Engagement Toolkit for Community-based Organisations. University of Birmingham.

Preprint

Leon, FALD, Madabushi, HT & Lee, M 2024 'Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text' arXiv, pp. 1-13. https://doi.org/10.48550/arXiv.2403.04872

Gamboa, LCL & Lee, M 2024 'Filipino Benchmarks for Measuring Sexist and Homophobic Bias in Multilingual Language Models from Southeast Asia'.

Review article

Liang, X, Quinton, M, Veldhuijzen van Zanten, J, Duan, Z, Carter, B, Heyes, A, Lee, M, Alharbi, A & Chen, S 2024, 'Legacies and impacts of major sporting events for communities and individuals from disadvantaged backgrounds: A systematic review', Equality, Diversity and Inclusion. https://doi.org/10.1108/EDI-02-2024-0058

View all publications in research portal

Expertise

  • Artificial Intelligence
  • Natural Language Processing

Mark has previously provided commentary for the following publications:

  • New Scientist
  • Daily Mail
  • The Metro
  • Daily Express