Hi! 👋
I'm Arpan.

Grad Student at Stanford CS

Building socially intelligent AI systems and improving multimodal LLM systems

Arpan at UIUC

Research Poster @ KDD

Stanford Campus

Stanford Campus

Meta Office

Meta HQ

About Me

I'm a Computer Science MS student at Stanford University and Research Assistant at Stanford AI Lab. I am passionate about building socially intelligent AI systems and improving human-AI interaction.

Previously, I was a Software Engineer (IC4) at Meta working on AI-driven groups and recommendation systems. I completed my BS in Computer Engineering from UIUC.

Research Interests

My research interests lie at the intersection of natural language processing (NLP), human-AI interaction, and graph neural networks (GNNs). I focus on developing socially intelligent agents, detecting corpus-level inconsistencies in LLMs, and enhancing transformers with relational embeddings.

Recent Updates

  • Starting MS in Computer Science at Stanford University!

  • Joined Stanford AI Lab (SALT, OVAL, SNAP) as Research Assistant.

  • Published paper at KDD '23 on IGB dataset.

  • Promoted to IC4 at Meta working on AI-Driven Groups.

  • Reviewed paper for a conference and submitted papers! @ICWSM '23 and @KDD '23.

  • Completed BS in Computer Engineering from UIUC.

  • Received highest intern rating 'Greatly Exceeds Expectations' at Meta.

  • Started internship at Meta in New York.

Experience

Stanford University

Research Assistant

Stanford University • Stanford, CA

  • Developing socially intelligent agent systems for non-competitive non-collaborative goals under Prof. Diyi Yang
  • Building LLM systems for detecting corpus-level inconsistencies with Prof. Monica Lam
  • Enhancing transformers with relational positional embeddings for relational databases with Prof. Jure Leskovec
Meta Inc.

Software Engineer IC4

Meta Inc. • Menlo Park, CA

  • Built new onboarding flows for admins and members leading to increased daily active group creations by 3%
  • Streamlined privacy changes and built framework to promote new admins to admin-less groups
  • Worked with GenAI in FB Groups to create post summaries and answer agents
Meta Inc.

Software Engineering Intern

Meta Inc. • New York, NY

  • Created infinite scroll comments, polls, and overview tab resulting in stat-sig increase in FB watch time
  • Simplified E2E testing framework and documentation for internal languages used by 50+ teams
  • Received highest intern rating of 'greatly exceeds expectation (GE)'

Education

Stanford University

Stanford University

MS in Computer ScienceTranscript•Stanford, CA•2023 - Present•GPA: Current

Achievements

  • •Research Assistant at Stanford AI Lab (SALT, OVAL, SNAP)
  • •Focus on AI Systems and Human-AI Interaction

Research

  • •Working on large language models and human-AI interaction
  • •Developing AI systems for improved user experience
  • •Collaborating with SALT, OVAL, and SNAP research groups

Notable Coursework

  • •CS 224N: Natural Language Processing
  • •CS 224U: Natural Language Understanding
  • •CS 234: Reinforcement Learning
  • •CS 330: Deep Multi-task and Meta Learning
University of Illinois at Urbana-Champaign

University of Illinois at Urbana-Champaign

BS in Computer EngineeringView ThesisTranscript•Urbana, IL•2019 - 2022•GPA: 4.00/4.00

Achievements

  • •James Scholar Honors
  • •Dean's List (All Semesters)

Notable Coursework

  • •ECE 391: Computer Systems Engineering
  • •ECE 411: Computer Organization & Design
  • •ECE 374: Algorithms & Models of Computation
  • •ECE 385: Digital Systems Laboratory

Teaching Experience

  • •Course Assistant for ECE 391: Computer Systems Engineering
  • •Course Assistant for ECE 313: Probability with Engineering Applications
  • •Course Assistant for ECE 210: Analog Signal Processing
  • •Honors Lab Instructor for ECE 110H/120H: Intro to Electronics and Computing

Honors & Awards

  • •Engineering Open House Best Project Award
  • •Knights of St. Patrick Honor
  • •Outstanding Academic Achievement Award

Publications

IGB: An Immense Graph Dataset for Machine Learning Workloads

IGB: An Immense Graph Dataset for Machine Learning Workloads

Arpandeep Khatua, Vikram Sharma Mailthody, Bhagyashree Taleka, Xiang Song, Tengfei Ma, Piotr Bigaj, Wen-mei Hwu

Submitted at KDD 2023

Largest public graph dataset for testing GNN models at scale and sytem optimization.

Detection, Categorization, and Comparison of Needs Expressed on Twitter during Crises

Detection, Categorization, and Comparison of Needs Expressed on Twitter during Crises

Pingjing Yang, Ly Dinh, Hamiz Anjum, Alex Stratton, Arpandeep Khatua, Jana Diesner, Richard Sowers

Submitted at ICWSM 2023

In this study, we use Twitter data to automatically identify who needs what and how types of needs, that we categorized and standardized, have evolved throughout the Ukraine-Russia conflict.

Generating High-Level Article Structure Based on Topic Using Two-stage Seq2seq Model

Generating High-Level Article Structure Based on Topic Using Two-stage Seq2seq Model

Arpandeep Khatua, Adit Agarwal, Kevin CC Chang

In Prep for ACL 2023

Multi-stage subtopic generator for long text generation and richer search results.

Hackathon Projects

CourseLoop

CourseLoop

Devpost
Winner at HackIllinois

Auto-grading on text extracted from PDF assignments with an OCR pipeline, using NLP in python with 98%+ accuracy in 1 week. Reduced auto-grading time by 50% utilizing better algorithms and libraries.

OCRNLPPythonMachine Learning
Kaizen Journal
Winner at HackDuke

Built a custom NLP model to classify text based on mental health conditions and a web page for easier access by patients and health-care professionals with an OCR and voice to text functionality.

NLPOCRVoice-to-TextHealthcare
Mauka - Job Search Portal
Presented at Hex Cambridge

Job search portal by scraping real time information from Google and LinkedIn to help curb increasing unemployment rates due to COVID-19 in developing countries.

Web ScrapingReactNode.jsJob Search

Recreational Interests

Painting

Oil on canvas - my latest work

Debating

President @ Toastmasters International Gavel Club

Hiking

Looking forward to recreate the macOS wallpaper pictures

Cooking

Trying out new recipes every weekend!

Entertainment

"If I can't scuba what's this all about?"

Get in Touch

I'm always open to discussing research collaborations, AI/ML projects, or opportunities in tech. Feel free to reach out!