Publications

2026

  1. predict_agents.svg
    Predicting a target agent’s next decision from K prior games. (A) LLM-as-Predictor directly prompts a large LLM for the decision. (B) A text-tabular formulation feeds game features and text representations to a tabular foundation model (TabPFN). (C) Our model adds LLM-as-Observer: the hidden state of a small frozen LLM that reads the game state and dialogue becomes an additional decision-oriented representation.
    Predicting Decisions of AI Agents from Limited Interaction through Text-Tabular Modeling
    Eilam ShapiraMoshe Tennenholtz, and Roi Reichart
    arXiv preprint arXiv:2605.12411, 2026
  2. multabench.svg
    The MulTaBench Curation Pipeline. Datasets are included if joint prediction outperforms unimodal baselines and if Target-Aware Representations improve on frozen, off-the-shelf embeddings.
    MulTaBench: Benchmarking Multimodal Tabular Learning with Text and Image
    Alan Arazi*Eilam Shapira*, Shoham Grunblat, Mor Ventura, Elad Hoffer, Gioia BlayerDavid HolzmüllerLennart PuruckerGaël VaroquauxFrank Hutter+7 more authors, and Roi Reichart
    arXiv preprint arXiv:2605.10616, 2026
  3. STRABLE: Benchmarking Tabular Machine Learning with Strings
    Gioia Blayer, Myung Jun Kim, Félix LefebvreLennart PuruckerAlan Arazi+3 more authorsEilam ShapiraRoi ReichartFrank Hutter, Marine Le MorvanDavid Holzmüller+4 more authors, and Gaël Varoquaux
    arXiv preprint arXiv:2605.12292, 2026
  4. alignment.png
    Pearson correlations of base models and human decisions (x-axis) vs. aligned models and human decisions (y-axis) across four game families. Points below the diagonal indicate base advantage.
    Alignment Makes Language Models Normative, Not Descriptive
    Eilam ShapiraMoshe Tennenholtz, and Roi Reichart
    arXiv preprint arXiv:2603.17218, 2026
  5. tabagent.png
    (a) TabSchema compiles trajectory-derived schema, state, and dependency signals into tabular features. (b) TabHead consumes features and candidates to output calibrated probabilities.
    TabAgent: A Framework for Replacing Agentic Generative Components with Tabular-Textual Classifiers
    Ido Levy*Eilam Shapira*, Yinon Goldshtein, Avi Yaeli, Nir Mashkif+2 more authors, and Segev Shlomov
    arXiv preprint arXiv:2602.16429, 2026
  6. Textual Planning with Explicit Latent Transitions
    Eliezer Shlomi, Ido Levy, Eilam Shapira, Michael Katz, Guy Uziel, Segev Shlomov, Nir MashkifRoi Reichart+5 more authors, and Sarah Keren
    2026
  7. poisoned_apple_effect.png
    Illustration of the "poisoned apple" example, in which Alice increases her payoff at Bob’s expense by releasing a new technology—without the players actually using that technology in practice.
    The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents
    Eilam ShapiraRoi Reichart, and Moshe Tennenholtz
    arXiv preprint arXiv:2601.11496, 2026
  8. From Benchmarks to Business Impact: Deploying IBM Generalist Agent in Enterprise Production
    Segev Shlomov, Alon Oved, Sami Marreed, Ido Levy, Offer Akrabi, Avi Yaeli, Łukasz Strąk, Elizabeth Koumpan, Yinon Goldshtein+7 more authorsEilam Shapira, Nir Mashkif, and Asaf Adi
    Proceedings of the AAAI Conference on Artificial Intelligence, Mar 2026

2025

  1. Donors and Recipients: On Asymmetric Transfer Across Tasks and Languages with Parameter-Efficient Fine-Tuning
    Kajetan Dymkiewicz, Ivan Vulic, Helen Yannakoudakis, Eilam ShapiraRoi Reichart, and Anna Korhonen
    arXiv preprint arXiv:2511.13368, 2025
  2. NeurIPS
    tabstar.png
    The TabSTAR architecture illustrated with our toy dataset. The model processes numerical features, textual features, and all possible target values for classification.
    TabSTAR: A Tabular Foundation Model for Tabular Data with Text Fields
    Alan AraziEilam Shapira, and Roi Reichart
    Advances in Neural Information Processing Systems, 2025
  3. Fairness under Competition
    Ronen Gradwohl, Eilam Shapira, and Moshe Tennenholtz
    Advances in Neural Information Processing Systems, 2025
  4. TACL ACL
    persuasion_game.png
    Illustration of a single round in the language-based persuasion game.
    Human Choice Prediction in Language-based Persuasion Games: Simulation-based Off-Policy Evaluation
    Transactions of the Association for Computational Linguistics, Aug 2025
  5. Transition Function Prediction in AI Planning Using LLMs
    Eliezer Shlomi, Guy Azran, Eilam Shapira, Omer NahumRoi Reichart, Guy Uziel, Michael Katz, Ateret Anaby Tavor+5 more authors, and Sarah Keren
    AAAI 2025 Workshop LM4Plan, 2025

2024

  1. glee.png
    GLEE: A Unified Framework and Benchmark for Language-based Economic Environments
    Eilam ShapiraOmer Madmon, Itamar Reinman, Samuel Joseph AmouyalRoi Reichart+2 more authors, and Moshe Tennenholtz
    arXiv preprint arXiv:2410.05254, 2024
  2. JAIR
    canllmreplace.png
    Results for the prediction task introduced in the paper, comparing alternative ways to use data from the 110 human players and from the LLM-generated players.
    Can Large Language Models Replace Economic Choice Prediction Labs? The Case of Language-based Persuasion Games
    Eilam ShapiraOmer MadmonRoi Reichart, and Moshe Tennenholtz
    arXiv preprint arXiv:2401.17435, 2024
  3. Text2Model: Text-based Model Induction for Zero-shot Image Classification
    Ohad Amosy, Tomer Volk, Eilam ShapiraEyal Ben-DavidRoi Reichart+2 more authors, and Gal Chechik
    Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024