Ivan Evtimov | Researcher Profile | Sotabase

Career

· Store Manager, Jimmy Choo2018–2024

· Store Manager, Elie Tahari2013–2018

· Store Manager, United Colors of Benetton2004–2013

Publications (49)

The Llama 3 Herd of Models

2024

12,282

cited

Seamless: Multilingual Expressive and Streaming Speech Translation

arXiv.org · 2023

229

cited

Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models

arXiv.org · 2023

121

cited

WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks

arXiv.org · 2025

cited

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

North American Chapter of the Association for Computational Linguistics · 2025

cited

Persistent Pre-Training Poisoning of LLMs

International Conference on Learning Representations · 2024

cited

Automated Red Teaming with GOAT: the Generative Offensive Agent Tester

arXiv.org · 2024

cited

AgentDAM: Privacy Leakage Evaluation for Autonomous Web Agents

arXiv.org · 2025

cited

SEA-LION: Southeast Asian Languages in One Network

arXiv.org · 2025

cited

2 OLMo 2 Furious (COLM’s Version)

cited

AdvPrefix: An Objective for Nuanced LLM Jailbreaks

arXiv.org · 2024

cited

Gradient-based Jailbreak Images for Multimodal Fusion Models

arXiv.org · 2024

cited

RL Is a Hammer and LLMs Are Nails: A Simple Reinforcement Learning Recipe for Strong Prompt Injection

arXiv.org · 2025

cited

The MultiGEC-2025 Shared Task on Multilingual Grammatical Error Correction at NLP4CALL

cited

Large Reasoning Models Learn Better Alignment from Flawed Thinking

arXiv.org · 2025

cited

LLäMmlein: Transparent, Compact and Competitive German-Only Language Models from Scratch

Annual Meeting of the Association for Computational Linguistics · 2024

cited

LogRules: Enhancing Log Analysis Capability of Large Language Models through Rules

North American Chapter of the Association for Computational Linguistics · 2025

cited

Towards Red Teaming in Multimodal and Multilingual Translation

arXiv.org · 2024

cited

Scaling Law for Post-training after Model Pruning

arXiv.org · 2024

cited

From KMMLU-Redux to Pro: A Professional Korean Benchmark Suite for LLM Evaluation

Conference on Empirical Methods in Natural Language Processing · 2025

cited

Sotabase