The Big Data Intelligence lab at KAIST is led by Prof. Joyce Jiyoung Whang at the School of Computing. The group engages in fundamental research on big data, emphasizing advanced algorithms for modeling and analyzing massive scientific and industrial datasets. In particular, the group focuses on graph machine learning and mining to interpret complex entity interactions and solve diverse real-world challenges with highly scalable algorithms.
Research Areas
연구 영역 상세
연구 영역을 선택하면 자세한 내용이 표시됩니다.
Professor
Members
Ph.D. Students
Chanyoung Chung (정찬영)
Mar. 2022 ~: M.S./Ph.D. Integrated Program in School of Computing
Mar. 2021 ~ Feb. 2022: M.S. Program in School of Computing
Feb. 2021: B.S. in Computing (double major: Mathematical Sciences), KAIST
Jaejun Lee (이재준)
Mar. 2023 ~: Ph.D. Program in School of Computing
Feb. 2023: M.S. in Computing, KAIST
Aug. 2021: B.S. in Computing (double major: Mathematical Sciences), KAIST
Heehyeon Kim (김희현)
Sep. 2024 ~: Ph.D. Program in School of Computing
Aug. 2024: M.S. in Computing, KAIST
Aug. 2022: B.S. in IoT Artificial Intelligence Convergence, Chonnam National University
Minsung Hwang (황민성)
Sep. 2024 ~: Ph.D. Program in School of Computing
Aug. 2024: M.S. in Computing, KAIST
Feb. 2023: B.S. in Electrical Engineering (minor: Computing), KAIST
Jinhyeok Choi (최진혁)
Mar. 2025 ~: Ph.D. Program in School of Computing
Feb. 2025: M.S. in Computing, KAIST
Feb. 2023: B.S. in Computing (minor: Electrical Engineering), KAIST
Kyeongryul Lee (이경률)
Mar. 2026 ~: Ph.D. Program in School of Computing
Feb. 2026: M.S. in Data Science, KAIST
Feb. 2024: B.S. in Data Science (double major: Financial Mathematics & Statistics), The University of Sydney
M.S. Students
Donggyu Yoon (윤동규)
Mar. 2025 ~: Program in School of Computing
Feb. 2025: B.S. in Computing (minor: Electrical Engineering), KAIST
Kidong Nam (남기동)
Mar. 2025 ~: KT-AI M.S. Program in School of Computing
Aug. 2024: B.S. in Philosophy (double major: Computer Science and Engineering), Sogang University
Jeesoo Kim (김지수)
Sep. 2025 ~: M.S. Program in Graduate School of Data Science
Aug. 2025: B.A. in Business Administration (double major: Computer Science and Engineering), Sogang University
Jeemin Kim (김지민)
Mar. 2026 ~: M.S. Program in School of Computing
Feb. 2026: B.S. in Computing (minor: Bio and Brain Engineering), KAIST
Seheon Kim (김세헌)
Mar. 2026 ~: M.S. Program in School of Computing
Feb. 2026: B.S. in Computing, KAIST
Alumni
Jaejun Lee (이재준)
M.S. in Computing, Feb. 2023, KAIST
Thesis: Image-based Augmented Knowledge Graph Embedding
Aug. 2021: B.S. in Computing (double major: Mathematical Sciences), KAIST
Seunghwan Kong (공승환)
M.S. in Computing, Feb. 2023, KAIST
Thesis: Representation Learning on Knowledge Graphs with Entity Types
Aug. 2021: B.S. in Computing (double major: Mathematical Sciences), KAIST
Heehyeon Kim (김희현)
M.S. in Computing, Aug. 2024, KAIST
Thesis: Fraud Detection Using Graph Neural Networks
Aug. 2022: B.S. in IoT Artificial Intelligence Convergence, Chonnam National University
Minsung Hwang (황민성)
M.S. in Computing, Aug. 2024, KAIST
Thesis: Theoretical Generalization Bounds for Knowledge Graph Representation Learning
Feb. 2023: B.S. in Electrical Engineering (minor: Computing), KAIST
Jinhyeok Choi (최진혁)
M.S. in Computing, Feb. 2025, KAIST
Thesis: Spatio-Temporal Graph Forecasting by Modeling Long-Range Dependency via Selective State Spaces
Feb. 2023: B.S. in Computing (minor: Electrical Engineering), KAIST
Junho Park (박준호)
M.S. in Computing, Feb. 2025, KAIST
Thesis: Root Cause Analysis for Microservice Systems with Resource-Sharing Dependencies
Aug. 2021: B.S. in Electrical Engineering and Computer Science, GIST
Kyeongryul Lee (이경률)
M.S. in Data Science, Feb. 2026, KAIST
Thesis: Probing Safety Vulnerabilities in Large Language Models and Multimodal Models via Auto-Generated Jailbreak Prompts
Feb. 2024: B.S. in Data Science (double major: Financial Mathematics & Statistics), The University of Sydney
Minhyeong An (안민형)
M.S. in Computing, Feb. 2026, KAIST
Thesis: Graph-enhanced Retrieval-Augmented Generation for Microservice Root Cause Analysis with Large Language Models
Feb. 2024: B.S. in Computing (double major: Electrical Engineering), KAIST
Selected Publications
International Publications
‡: Equal Contribution, *: Corresponding Author
Beneath the Facade: Probing Safety Vulnerabilities in LLMs via Auto-Generated Jailbreak Prompts
Findings of the Association for Computational Linguistics: EMNLP (Findings of EMNLP)
Structure Is All You Need: Structural Representation Learning on Hyper-Relational Knowledge Graphs
International Conference on Machine Learning (ICML)
Stability and Generalization Capability of Subgraph Reasoning Models for Inductive Knowledge Graph Completion
International Conference on Machine Learning (ICML)
Unveiling the Threat of Fraud Gangs to Graph Neural Networks: Multi-Target Graph Injection Attacks against GNN-Based Fraud Detectors
AAAI Conference on Artificial Intelligence (AAAI)
Unifying Inductive, Cross-Domain, and Multimodal Learning for Robust and Generalizable Recommendation
Multimodal Generative Search and Recommendation (MMGenSR) Workshop at Conference on Information and Knowledge Management (CIKM)
SAIF: A Comprehensive Framework for Evaluating the Risks of Generative AI in the Public Sector
AI for Public Missions (AIPM) Workshop at AAAI Conference on Artificial Intelligence (AAAI)
PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning
International Conference on Machine Learning (ICML)
Why So Gullible? Enhancing the Robustness of Retrieval-Augmented Models against Counterfactual Noise
Findings of the Association for Computational Linguistics: NAACL (Findings of NAACL)
SpoT-Mamba: Learning Long-Range Dependency on Spatio-Temporal Graphs with Selective State Spaces
Spatio-Temporal Reasoning and Learning (STRL) Workshop at International Joint Conference on Artificial Intelligence (IJCAI)
VISTA: Visual-Textual Knowledge Graph Representation Learning
Findings of Empirical Methods in Natural Language Processing (Findings of EMNLP)
FinePrompt: Unveiling the Role of Finetuned Inductive Bias on Compositional Reasoning in GPT-4
Findings of Empirical Methods in Natural Language Processing (Findings of EMNLP)
Representation Learning on Hyper-Relational and Numeric Knowledge Graphs with Transformers
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD)
InGram: Inductive Knowledge Graph Embedding via Relation Graphs
International Conference on Machine Learning (ICML)
Learning Representations of Bi-level Knowledge Graphs for Reasoning beyond Link Prediction
AAAI Conference on Artificial Intelligence (AAAI)
Dynamic Relation-Attentive Graph Neural Networks for Fraud Detection
Machine Learning on Graphs (MLoG) Workshop at IEEE International Conference on Data Mining (ICDM)
Semantic Grasping via a Knowledge Graph of Robotic Manipulation: A Graph Representation Learning Approach
IEEE Robotics and Automation Letters
HiddenCPG: Large-Scale Vulnerable Clone Detection Using Subgraph Isomorphism of Code Property Graphs
The ACM Web Conference
Knowledge Graph Embedding via Metagraph Learning
International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)
Image-based Lifelogging: User Emotion Perspective
Computers, Materials & Continua
Sentiment-based Sub-event Segmentation and Key Photo Selection
Journal of Visual Communication and Image Representation
MEGA: Multi-View Semi-Supervised Clustering of Hypergraphs
International Conference on Very Large Data Bases (VLDB)
Scalable Anti-TrustRank with Qualified Site-level Seeds for Link-based Web Spam Detection
Workshop on CyberSafety: Computational Methods in Online Misbehavior at the Web Conference
Hyperlink Classification via Structured Graph Embedding
International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR)
Non-exhaustive, Overlapping Clustering
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
SmartGrip: Grip Sensing System for Commodity Mobile Devices through Sound Signals
Personal and Ubiquitous Computing
Fast Asynchronous Anti-TrustRank for Web Spam Detection
Workshop on MIS2: Misinformation and Misbehavior Mining on the Web at ACM International Conference on Web Search and Data Mining (WSDM)
Non-Exhaustive, Overlapping Co-Clustering
ACM Conference on Information and Knowledge Management (CIKM)
An Empirical Study of Community Overlap: Ground-truth, Algorithmic Solutions, and Implications
ACM Conference on Information and Knowledge Management (CIKM)
Fast Multiplier Methods to Optimize Non-exhaustive, Overlapping Clustering
SIAM International Conference on Data Mining (SDM)
Overlapping Community Detection Using Neighborhood-Inflated Seed Expansion
IEEE Transactions on Knowledge and Data Engineering (TKDE)
Non-exhaustive, Overlapping Clustering via Low-Rank Semidefinite Programming
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD)
Scalable Data-driven PageRank: Algorithms, System Issues, and Lessons Learned
International European Conference on Parallel and Distributed Computing (Euro-Par)
Non-exhaustive, Overlapping k-means
SIAM International Conference on Data Mining (SDM)
Stochastic Blockmodel with Cluster Overlap, Relevance Selection, and Similarity-Based Smoothing
IEEE International Conference on Data Mining (ICDM)
Overlapping Community Detection Using Seed Set Expansion
ACM Conference on Information and Knowledge Management (CIKM)
Scalable and Memory-Efficient Clustering of Large-Scale Social Networks
IEEE International Conference on Data Mining (ICDM)
Scalable Clustering of Signed Networks using Balance Normalized Cut
ACM Conference on Information and Knowledge Management (CIKM)
Parallel Clustered Low-rank Approximation of Graphs and Its Application to Link Prediction
International Workshop on Languages and Compilers for Parallel Computing (LCPC)
Domestic Papers
Root Cause Analysis for Microservice Systems Using Anomaly Propagation by Resource Sharing (자원 공유에 따른 이상치 전파를 활용한 마이크로서비스 시스템 결함의 근본 원인 분석)
정보과학회논문지 (Journal of KIISE), 2025년 4월
Knowledge Graph Embedding with Entity Type Constraints (개체 유형 정보를 활용한 지식 그래프 임베딩)
정보과학회논문지 (Journal of KIISE), 2022년 9월
Patents
Projects
National Research Foundation of Korea
-
▶
Responsible Multimodal Graph AI (책임 있는 멀티모달 그래프 인공지능)
-
▶
Extendable Graph Representation Learning (확장 가능한 그래프 표현학습)
-
▶
MARS Artificial Intelligence Integrated Research Center (MARS 인공지능 통합연구센터)
-
▶
Semi-Supervised Multi-View Learning with Graphs (다각적 데이터 융합을 통한 그래프기반 준지도 학습)
-
▶
Modeling Information Propagation by Exploiting the Clustering Structure of Massive Social Networks (거대 소셜 네트워크의 클러스터링 구조를 활용한 정보 전파 메커니즘 모델링)
Samsung Electronics
-
▶
AI Agent-based Omni Knowledge Graph Construction and its Applications (AI Agent 기반 Omni Knowledge Graph 구축 및 활용 기술 개발)
-
▶
Knowledge Graph Modeling for Semiconductor Data (반도체 공정 데이터의 다각적 분석을 위한 지식 그래프 모델링)
Institute of Information & communications Technology Planning & Evaluation (IITP)
Kyobo & DPLANEX
-
▶
Multimodal Cross-Domain Recommendation Systems for Personalized Services (개인 맞춤형 서비스를 위한 멀티모달 크로스 도메인 추천 시스템)
-
▶
GNN-based Insurance Fraud Detection (그래프 신경망(GNN) 기반 보험사기 예측 연구)
Telecommunications Technology Association (TTA)
-
▶
생성형 AI의 기술 혁신에 대응 가능한 안정성 평가체계 수립 연구
-
▶
AI 위험 분야별 안전성 평가를 위한 데이터셋 로드맵 구축
DevStack
-
▶
Development of a Technology for Analyzing Root Causes and Proposing Countermeasures for Containerized OpenStack Service’s Failures Using LLM and Knowledge Graphs (거대언어모델 및 지식그래프를 활용하여 컨테이너화된 오픈스택 장애원인 분석 및 대응방안 제시 기술개발)
-
▶
Development of Intelligent Kubernetes Fault Cause Analysis and Response Proposal Method Using LLM and Knowledge Graphs (LLM 및 지식그래프를 활용한 지능형 K8s 장애 원인 분석 및 대응 방안 제시 기술 개발)
Photos
Recruit
Location
KAIST, N1 Building
Prof.: N1, 905
Lab.: N1, 921
Admin.: N1, 904
Professor Joyce Jiyoung Whang
Tel
Prof.: 042-350-3584
Lab.: 042-350-7784
Admin.: 042-350-7884