CV
You can download the CV from pdf icon.
Profile
| Name | Akanksha Singh |
| Title | Researcher |
| akankshasingh.2540@gmail.com | |
| Phone | (+91) 766 732 8057 |
| Website | akankshasingh25.github.io |
| Summary | Researcher in machine learning and deep learning with expertise in vision and speech. My broad goal is advancing responsible AI research, with specific focus on fairness, inclusion, explainability, societal and environmental impact. |
| -singh-akanksha | |
| GitHub | akankshasingh25 |
Work Experience
-
08.2024 - Present Post-Baccalaureate Fellow
Centre for Responsible AI (CeRAI), IIT Madras, India
- Supervisor - Prof. Balaraman Ravindran
- Leading the research on grounding artefacts of audio deepfakes into linguistic and acoustic features, speech and non-speech, phonetics, phonology, morphology, syntax, and semantics to develop an explainable detection model by design, tailored to the Indian context.
- At the initial discussion stage of the project on multi-agent systems for multimodal deepfake detection, in tandem with the RAG framework to provide additional support through fact-checking.
- Chapter author of the 'Grounding Responsible AI in Agriculture', a project in collaboration with Meta OpenLoop. The work explores current applications and associated risks of AI in agriculture, identifies key gaps in its development and deployment, and examines relevant policies, regulations, and legal frameworks. The paper emphasises the importance of responsible AI through actionable technical and policy recommendations, grounded in expert interviews and comprehensive analysis research.
-
05.2023 - 07.2024 External Master's Thesis Student
Image Analysis and Biometrics (IAB) Lab, Department of CSE, IIT Jodhpur, India
- Supervisor - Prof. Mayank Vatsa & Prof. Richa Singh
- Created and curated a large (approx 1.3M samples) and diverse 26 generation techniques across 4 sets) bias-free benchmark multimodal multilingual deepfake dataset comprising identity-aware swaps and synthetic media.
- Performed qualitative experiments such as BRISQUE score for visual quality and FAD for audio quality. And benchmarked existing state-of-the-art unimodal (visual and audio) and multimodal deepfake detection models.
Education
-
08.2019 - 07.2024 Bhopal, India
Bachelor of Science - Master of Science (BS-MS) Dual Degree
Indian Institute of Science Education and Research Bhopal
Major: Electrical Engineering and Computer Science; Minor: Data Science and Engineering
- Data Science and Artificial Intelligence - Machine Learning, Deep Learning, Natural Language Processing, Computer Vision
- Computer Science - Data Structures, Algorithms, Computer Architecture, Operating Systems
- Mathematics - Linear Algebra, Probability and Statistics, Calculus
Publications
-
2025 ILLUSION: Unveiling Truth with a Comprehensive Multi-Modal, Multi-Lingual Deepfake Dataset
The Thirteenth International Conference on Learning Representations (ICLR 2025)
Technical Skills
| Machine Learning & Deep Learning | |
| Python | |
| PyTorch | |
| Hugging Face | |
| Git/GitHub | |
| Bash Scripting | |
| LaTeX | |
| Slurm | |
| Docker/Kubernetes |
Research Interests
| Fairness | |
| Evalaution of bias in models and datasets | |
| Mitigation strategies | |
| Debiasing strategies |
| Model Editing | |
| Unlearning | |
| Membership Inference Attacks | |
| Memorisation |
| Misinformation | |
| Deepfake Detection - Audio, Vision, Text and Multimodal | |
| Dissemination of (mis-mal-dis)information |
| Inclusion | |
| Multilingual & Multicultural AI | |
| Low-resource languages |
Honors & Awards
- 2024
Women Post-Baccalaureate Fellow
Centre for Responsible AI (CeRAI), IIT Madras
- 2023
Qualified for Assistant Professor, Subject - CSA
University Grants Commission - National Eligibility Test (UGC-NET)
Research Projects
- 01.2023 - 03.2023
Interpretable Deep Learning for Text Classification
- Implemented an existing novel interpretability framework for text CNNs from scratch to linguistically motivated features. The proposed features are borrowed from existing work on text classification of shorter texts between fiction and non-fiction genres Brown Corpus).
- The exit criteria are to corroborate our interpretability results with the baseline results obtained from the existing work using logistic regression.
- 08.2022 - 11.2022
Consumer Complaints: Text Classification Problem
- Used machine learning and deep learning models for a multi-class nominal label classification
- A vectorization and feature selection pipeline was used for ML architecture, and hyperparameters were fed to a cross-validation grid search on 5 supervised learning models. Logistic regression gave the best results, accuracy of 85.9%.
- For DL architecture, FFN, bi-directional LSTM, RNN, and transformers were used. Bi-directional LSTM gave the best results, with an accuracy of 86.5%.
Languages Known
| Hindi | |
| Native speaker |
| English | |
| Bilingual proficiency |