About Me
I am a Postdoctoral Researcher at King’s College London working with Dr Albert Meroño Peñuela and Professor Elena Simperl at the Department of Informatics. With a PhD in Knowledge Graphs, my current research focuses on multimodal content generation and generative models grounded in structured knowledge. I am also developing neuro-symbolic methods for interpretable AI and actively contributing to the responsible AI features in standards like MLCommons Croissant.
Recent News
- (May 2025) Our paper on An Annotation Protocol for Diachronic Evaluation of Semantic Drift in Disability Sources has been accepted at the Linguistic Annotation Workshop (LAW) 2025, which will be co-located with ACL 2025 in Vienna this year.
- (Mar 2025) Our research manuscript proposing a neurosymbolic approach for generating interpretable embeddings has been accepted for publication in the Neurosymbolic AI Journal! (preprint here).
- (Dec 2024) Excited to share our paper on the Croissant metadata standard will be presented as a spotlight paper at NeurIPS 2024 (top ~3% of submissions).
- (Nov 2024) Attended the 10th Meet-up of The Turing Interest Group on Knowledge Graphs in at the Edinburgh Future Institute, University of Edinburgh and gave a presentation titled ‘Towards Interpretable Embeddings: Aligning Representations with Semantic Aspects’.
- (Sept 2024) Excited to have been invited to present our work on the Croissant metadata standard and the Responsonsible AI vocabulary at the Sony AI Journal Club with a talk titled ‘Croissant-RAI: Standardized Machine-readable Dataset Documentation Format for Responsible AI’ (slides here).
Recent Publications
- Nitisha Jain, Antoine Domingues, Adwait Baokar, Albert Meroño Peñuela, Elena Simperl : Towards Interpretable Embeddings: Aligning Representations with Semantic Aspects. Neurosymbolic AI Journal 2025. [Paper]
- Nitisha Jain, Chiara Di Bonaventura, Albert Meroño-Peñuela, Barbara McGillivray : An Annotation Protocol for Diachronic Evaluation of Semantic Drift in Disability Sources. Linguistic Annotation Workshop (LAW) 2025, Association for Computational Linguistics (ACL). [Paper]
- Mubashara Akhtar, Omar Benjelloun, Costanza Conforti, Luca Foschini, Joan Giner-Miguelez, Pieter Gijsbers, Sujata Goswami, Nitisha Jain et al. : Croissant: A Metadata Format for ML-Ready Datasets. Neural Information Processing Systems (NeurIPS) 2024. [Paper] (Spotlight paper)
- Towards deployment-centric multimodal AI beyond vision and language. arXiv preprint 2025 arXiv:2504.03603
- Elisavet Koutsiana, Ioannis Reklos, Kholoud Saad Alghamdi, Nitisha Jain, Albert Meroño-Peñuela, Elena Simperl : Talking Wikidata: Communication Patterns and Their Impact on Community Engagement in Collaborative Knowledge Graphs. Transactions on Graph Data & Knowledge, 2025. [Paper]
- Elisavet Koutsiana, Tushita Yadav, Nitisha Jain, Albert Merono Penuela, Elena Simperl : Agreeing and disagreeing in collaborative knowledge graph construction: An analysis of Wikidata. Journal of Web Semantics, 2025. [Paper]
- Nitisha Jain, Mubashara Akhtar, Joan Giner-Miguelez, Rajat Shinde, Omar Benjelloun, Elena Simperl et al. : A Standardized Machine-readable Dataset Documentation Format for Responsible AI. arXiv preprint 2024 arXiv:2407.16883
- Mubashara Akhtar, Omar Benjelloun, Costanza Conforti, Pieter Gijsbers, Joan Giner-Miguelez, Nitisha Jain et al.: Croissant: A Metadata Format for ML-Ready Datasets. Proceedings of the Eighth Workshop on Data Management for End-to-End Machine Learning (DEEM), 2024. [Paper]
- Antoine Domingues, Nitisha Jain, Albert Meroño Peñuela, Elena Simperl: Bringing Back Semantics to Knowledge Graph Embeddings : An Interpretability Approach. Proceedings of the 18th International Conference on Neural-Symbolic Learning and Reasoning (NeSy), 2024. [Paper]
- Jacopo de Berardinis, Valentina Anita Carriero, Nitisha Jain et al.: The Polifonia Ontology Network: Building a Semantic Backbone for Musical Heritage. Proceedings of the 22nd International Semantic Web Conference (ISWC), 2023. [Paper]
- Bohui Zhang, Ioannis Reklos, Nitisha Jain, Albert Meroño Peñuela, Elena Simperl: Using Large Language Models for Knowledge Engineering (LLMKE): A Case Study on Wikidata. Joint proceedings of the KBC-LM workshop and the LM-KBC challenge @ ISWC 2023. [Paper]
Miscellaneous
Service
-
Organization of the Workshop on Generative Neuro-Symbolic AI (GeNeSy), co-located with ESWC 2024.
-
Sponsorship Chair of the ESWC 2024 conference along with Jan-Christoph Kalo.
-
Paper reviewing
- 2025 : Semantic Web Journal, ISWC, Semantics, SemDH (ESWC)
- 2024 : Semantics, AI4DH (ESWC)
- 2023 : TKDD, Semantic Web Journal
- 2022 : Semantic Web Journal, WebConference, ESWC, CIKM, LWDA,
- Previously : EMNLP(2021) LWDA(2020), AAAI(2019).
- PC member for AI4DH 2021 (held in conjunction with ICIAP 2021), SUKI 2022 (in conjunction with NAACL 2022).
- Reviewed papers for Semantic Web Journal(2022), WebConference(2022), ESWC(2022), CIKM(2022), LWDA(2022), EMNLP(2021), LWDA(2020), AAAI(2019).
- PC member for AI4DH 2021 (held in conjunction with ICIAP 2021), SUKI 2022 (in conjunction with NAACL 2022).
Teaching and Supervision
- January 2025 - Teaching Assistant for Network Data Analysis (Masters lecture)
- January 2025 - Teaching Assistant for Knowledge Engineering (Bachelors lecture)
- January 2024 - Teaching Assistant for Network Data Analysis (Masters lecture)
- Summer 2024 - Bringing Back Semantics to Knowledge Graph Embeddings : An Interpretability Approach (internship), Student : Antoine Domingues, MSc, ENSTA Paris
- Summer 2023 - Evaluation of the understanding and knowledge of Large Language Models compared to Knowledge Graphs (internship), Albin Joyeux, MSc, ENSTA PAris
- Summer 2023 - Collaborative Use Of Generative AI (internship), Student : Shantanu Suwarnkar