Sotabase
Home
Researchers
Career
·
Store Manager
,
Jimmy Choo
2018–2024
·
Store Manager
,
Elie Tahari
2013–2018
·
Store Manager
,
United Colors of Benetton
2004–2013
Publications
(49)
The Llama 3 Herd of Models
2024
12,282
cited
Seamless: Multilingual Expressive and Streaming Speech Translation
arXiv.org · 2023
229
cited
Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models
arXiv.org · 2023
121
cited
WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks
arXiv.org · 2025
49
cited
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study
North American Chapter of the Association for Computational Linguistics · 2025
36
cited
Persistent Pre-Training Poisoning of LLMs
International Conference on Learning Representations · 2024
35
cited
Automated Red Teaming with GOAT: the Generative Offensive Agent Tester
arXiv.org · 2024
31
cited
AgentDAM: Privacy Leakage Evaluation for Autonomous Web Agents
arXiv.org · 2025
27
cited
SEA-LION: Southeast Asian Languages in One Network
arXiv.org · 2025
26
cited
2 OLMo 2 Furious (COLM’s Version)
14
cited
AdvPrefix: An Objective for Nuanced LLM Jailbreaks
arXiv.org · 2024
12
cited
Gradient-based Jailbreak Images for Multimodal Fusion Models
arXiv.org · 2024
7
cited
RL Is a Hammer and LLMs Are Nails: A Simple Reinforcement Learning Recipe for Strong Prompt Injection
arXiv.org · 2025
7
cited
The MultiGEC-2025 Shared Task on Multilingual Grammatical Error Correction at NLP4CALL
7
cited
Large Reasoning Models Learn Better Alignment from Flawed Thinking
arXiv.org · 2025
6
cited
LLäMmlein: Transparent, Compact and Competitive German-Only Language Models from Scratch
Annual Meeting of the Association for Computational Linguistics · 2024
5
cited
LogRules: Enhancing Log Analysis Capability of Large Language Models through Rules
North American Chapter of the Association for Computational Linguistics · 2025
5
cited
Towards Red Teaming in Multimodal and Multilingual Translation
arXiv.org · 2024
4
cited
Scaling Law for Post-training after Model Pruning
arXiv.org · 2024
3
cited
From KMMLU-Redux to Pro: A Professional Korean Benchmark Suite for LLM Evaluation
Conference on Empirical Methods in Natural Language Processing · 2025
2
cited
Show all 49 papers →
Sotabase
Ivan Evtimov | Researcher Profile | Sotabase | Sotabase