Masafumi Oyamada

小山田昌史

Chief Scientist

NEC Corporation

Research Interests

I am a computer scientist based in Tokyo, Japan. I am passionate about maximizing human potential through the power of machines (機械) and knowledge (知識). My current research interests focus on three key areas: developing agentic AI systems that can autonomously interact with their environment 🤖, advancing computer automation to streamline complex workflows ⚡, and exploring methods for LLM self-improvement to enhance AI capabilities 📈.

I am now a Chief Scientist at NEC Corporation, where I lead research initiatives in large language models, data preprocessing automation, and AI systems. Our recent work includes developing novel approaches for LLM self-improvement, optimizing low-resource language model training, and creating intelligent systems for data preprocessing and table understanding. We are particularly focused on making AI systems more efficient, autonomous, and capable of handling complex real-world tasks.

Contact

E-Mail
I welcome opportunities for research collaboration and internships in the areas of AI systems, automation, and language models. If you're interested in working together on cutting-edge AI research, please reach out.

Optimizing Low-Resource Language Model Training: Comprehensive Analysis of Multi-Epoch, Multi-Lingual, and Two-Stage Approaches.

Kosuke Akimoto, Masafumi Oyamada

Preprint(2024)

Masafumi Oyamada

小山田 昌史

Research Interests

Contact

Recent News

Paper Accepted to COLM 2025 🦅

Paper Accepted to ACL 2025 📝

Paper Accepted to SIGIR 2025 🔍

Paper Accepted to BioNLP2025 🏥

Paper Accepted to NAACL 2024 main track 🤖

KeyNote at ComSys 2024 🗣️

Paper Accepted to IEEE Big Data 2024 🤖

Paper Accepted to EMNLP 2024 main track 🪼

Lecture at Kobe University 🎓

Talk at ACM MM 2024 🎯

Invited Talk at WebDB Summer Workshop 🗣️

Tutorial Accepted to CIKM 2024 📊

Talk at IPSJ Seminar 🎤

Paper Accepted to *SEM@NAACL 2024 🎯

Paper Accepted to VLDB 2023 🔍

Paper Accepted to IEEE Big Data 2023 📚

Paper Accepted to EMNLP Findings 2023 🔍

Paper Accepted to PAKDD 2023 🤝

Paper Accepted to SIGIR 2022 📊

Publications

Can a Crow Hatch a Falcon? Lineage Matters in Predicting Large Language Model Performance

An Empirical Study of LLM-as-a-Judge: How Design Choices Impact Evaluation Reliability

LaMDAgent: An Autonomous Framework for Post-Training Pipeline Optimization via LLM Agents

SCAN: Semantic Document Layout Analysis for Textual and Visual Retrieval-Augmented Generation

LLM-based Query Expansion Fails for Unfamiliar and Ambiguous Queries

Mining Hidden Thoughts from Texts: Evaluating Continual Pretraining with Synthetic Data for LLM Reasoning

On Synthesizing Data for Context Attribution in Question Answering

DISC: Dynamic Decomposition Improves LLM Inference Scaling

Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain

Can Large Language Models Invent Algorithms to Improve Themselves?

Optimizing Low-Resource Language Model Training: Comprehensive Analysis of Multi-Epoch, Multi-Lingual, and Two-Stage Approaches.

Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing.

On the Use of Large Language Models for Table Tasks.

Relevance, Diversity, and Exclusivity: Designing Keyword-augmentation Strategy for Zero-shot Classifiers.

Large Language Models as Data Preprocessors.

LightPAL: Lightweight Passage Retrieval for Open Domain Multi-Document Summarization.

DeepJoin: Joinable Table Discovery with Pre-trained Language Models.

Towards Large Language Model Organization: A Case Study on Abstractive Summarization.

Cross-Domain User Similarity without Overlapping Attributes via Optimal Transport Theory.

Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering.

QA-Matcher: Unsupervised Entity Matching Using a Question Answering Model.

Large Language Models as Data Preprocessors.

Jellyfish: A Large Language Model for Data Preprocessing.

Table Enrichment System for Machine Learning.

Table Enrichment System for Machine Learning.

DeepJoin: Joinable Table Discovery with Pre-trained Language Models.

Continuous top-k spatial-keyword search on dynamic objects.

Low-resource Taxonomy Enrichment with Pretrained Language Models.

Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach.

User Identity Linkage for Different Behavioral Patterns across Domains.

Quality Control for Hierarchical Classification with Incomplete Annotations.

Learning with Unsure Responses.

Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach.

Meimei: An Efficient Probabilistic Approach for Semantically Annotating Tables.

Extracting Feature Engineering Knowledge from Data Science Notebooks.

Compressed Vector Set: A Fast and Space-Efficient Data Mining Framework.

Accelerating Feature Engineering with Adaptive Partial Aggregation Tree.

Relational Mixture of Experts: Explainable Demographics Prediction with Behavioral Data.

Link Prediction for Isolated Nodes in Heterogeneous Network by Topic-Based Co-clustering.

MOARLE: Matrix Operation Accelerator Based on Run-Length Encoding.

Continuous query processing with concurrency control: reading updatable resources consistently.

Efficient Invocation of Transaction Sequences Triggered by Data Streams.

小山田昌史