Research

I work at the intersection of privacy-preserving NLP, clinical mental health AI, and dialogue systems. My PhD focuses on generating high-fidelity synthetic therapy sessions that preserve patient privacy. I am also broadly interested in multimodality and representation learning.

Publications

Preprint 2025

Graph2Counsel: Clinically Grounded Synthetic Counseling Dialogue Generation from Client Psychological Graphs

Aishik Mandal, Hiba Arnaout, Clarissa W. Ong, Juliet Bockhorst, Kate Sheehan, Rachael Moldow, Tanmoy Chakraborty, Iryna Gurevych

A framework for generating clinically grounded synthetic counseling dialogues from structured client psychological graphs, enabling privacy-preserving training data for mental health AI.

Paper Code Data & Model

Frontiers in Digital Health 2026

SPEAK-SAFE: Secure Processing of Electronic Audio for Knowledge in Suicide Assessment from Therapeutic Exchanges

Christopher Landau, Patricia Getty, Caroline Gruler, Rebekka Stadje, Sofia Arampatzi, Aishik Mandal, Anmol Goel, Iryna Gurevych, Andreas Reif, Oliver Grimm

Secure audio processing pipeline for suicide risk assessment from therapeutic exchanges, combining privacy-preserving techniques with clinical NLP.

Paper

Preprint 2025

MAGneT: Coordinated Multi-Agent Generation of Synthetic Multi-Turn Mental Health Counseling Sessions

Aishik Mandal, Tanmoy Chakraborty, Iryna Gurevych

A multi-agent LLM framework for generating synthetic therapy sessions, achieving 3.8% improvement on standard clinical scales and 77.2% expert preference over state-of-the-art datasets.

Paper Code Data & Model

Preprint 2025

A Comprehensive Survey of Datasets for Clinical Mental Health AI Systems

Aishik Mandal, Prottay Kumar Adhikary, Hiba Arnaout, Iryna Gurevych, Tanmoy Chakraborty

A structured survey of datasets used in clinical mental health AI, covering data collection methodologies, annotation practices, privacy considerations, and benchmark tasks.

Paper

Privacy-aware Mental Health AI thumbnail

Nature Computational Science 2025

Towards Privacy-aware Mental Health AI Models: Advances, Challenges, and Opportunities

Aishik Mandal, Tanmoy Chakraborty, Iryna Gurevych

Identifies current privacy issues and threats in mental health AI models and datasets. Reviews solutions from literature, evaluation methods, and proposes a pipeline for creating privacy-aware AI models in the mental health domain.

Paper

Findings EMNLP 2025

CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation

Emilio Villa-Cueva, Sholpan Bolatzhanova, Diana Turmakhan, Kareem Elzeky, …, Aishik Mandal, …, Thamar Solorio

A benchmark for evaluating cultural awareness in multimodal machine translation, covering multiple languages and cultural contexts.

Paper

Oral · CLPsych 2025

Enhancing Depression Detection via Question-wise Modality Fusion

Aishik Mandal, Dana Atzil-Slonim, Thamar Solorio, Iryna Gurevych

A novel multimodal depression severity prediction framework with a question-aware fusion mechanism and a loss function designed for imbalanced ordinal classification.

Paper Code

Oral · NeurIPS 2024

CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

David Orlando Romero Mogrovejo, Chenyang Lyu, Haryo Akbarianto Wibowo, Santiago Gongora, Aishik Mandal, …, Thamar Solorio, Alham Fikri Aji

A multicultural and multilingual visual QA dataset spanning diverse cultural contexts, with comprehensive benchmarking of current large vision-language models.

Paper Dataset

TMLR 2023

A Revenue Function for Comparison-Based Hierarchical Clustering

Aishik Mandal, Michaël Perrot, Debarghya Ghoshdastidar

Introduced the first framework to evaluate hierarchical clustering using comparisons. A novel revenue function avoids reliance on pairwise similarities or ground-truth trees, achieving rank correlations of +0.9 with standard quality metrics.

Paper Code

Discourse Mutual Information model thumbnail

NAACL 2022

Representation Learning for Conversational Data using Discourse Mutual Information Maximization

Bishal Santra, Sumegh Roychowdhury, Aishik Mandal, Vasu Gurram, Atharva Naik, Manish Gupta, Pawan Goyal

Pretrained dialogue encoder using a structure-aware Discourse Mutual Information (DMI) loss function, achieving state-of-the-art results on dialogue understanding and retrieval tasks.

Paper Project Page

CIKM 2021

Knowledge-Aware Neural Networks for Medical Forum Question Classification

Soumyadeep Roy, Sudip Chakraborty, Aishik Mandal, Gunjan Balde, Prakhar Sharma, Anandhavelu Natarajan, Megha Khosla, Shamik Sural, Niloy Ganguly

A knowledge-aware BERT model for classifying questions in medical forums, improving accuracy on clinical text classification by incorporating external medical knowledge.

Paper Code

Thesis & Research Projects

Master's Thesis 2023

Universal Transformer for Multimodal Self-supervised Learning

Aishik Mandal, Jiaul Hoque Paik · IIT Kharagpur, Centre of Excellence in AI

Proposed a novel crossmodal contrastive loss to train a 12-layer transformer for joint image–text embeddings on COCO, achieving competitive performance with Sentence Transformers on text-only tasks.

Report

Turn-taking in conversational agents thumbnail

Internship Project 2022

Multimodal Turn Taking in Conversational Agents

Aishik Mandal, Biswesh Mahapatra, Justine Cassell · INRIA Paris, Articulabo

Developed a multimodal turn-taking predictor for conversational agents using LSTMs, attention mechanisms, and a pre-trained transformer to jointly model audio, visual, and text signals. Achieved macro F1 improvement of +0.09 over state-of-the-art models.

Bachelor's Thesis 2022

Discourse Mutual Information for Dialogue Understanding and Response Generation

Aishik Mandal, Bishal Santra, Pawan Goyal · IIT Kharagpur, Dept. of CSE

Proposed a novel discourse mutual information loss for a transformer-based dual encoder, plus a mapping algorithm from context to response embeddings. Achieved +4.1% on dialogue classification tasks and +10.6% on dialogue evaluation tasks over state-of-the-art baselines.

Report