Jianing Zhu | Researcher Profile | Sotabase

Career

· Audit Intern, ALLEN & COOK, INC. CERTIFIED PUBLIC2025–2025

· MSAA Student in Accounting and Analytics, San José State University2025–2026

· Postdoctoral Researcher, UT Austin2025–

Publications (24)

Geometry-aware Instance-reweighted Adversarial Training

International Conference on Learning Representations · 2020

309

cited

DeepInception: Hypnotize Large Language Model to Be Jailbreaker

arXiv.org · 2023

305

cited

Reliable Adversarial Distillation with Unreliable Teachers

International Conference on Learning Representations · 2021

cited

Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation

Neural Information Processing Systems · 2023

cited

Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?

Neural Information Processing Systems · 2024

cited

Understanding the Interaction of Adversarial Training with Noisy Labels

arXiv.org · 2021

cited

Towards Effective Evaluations and Comparisons for LLM Unlearning Methods

International Conference on Learning Representations · 2024

cited

Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability

International Conference on Machine Learning · 2023

cited

Model Inversion Attacks: A Survey of Approaches and Countermeasures

arXiv.org · 2024

cited

Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection

Neural Information Processing Systems · 2024

cited

Adversarial Training with Complementary Labels: On the Benefit of Gradually Informative Attacks

Neural Information Processing Systems · 2022

cited

What If the Input is Expanded in OOD Detection?

Neural Information Processing Systems · 2024

cited

Combating Exacerbated Heterogeneity for Robust Models in Federated Learning

International Conference on Learning Representations · 2023

cited

Decoupling the Class Label and the Target Concept in Machine Unlearning

arXiv.org · 2024

cited

Co-Reward: Self-supervised Reinforcement Learning for Large Language Model Reasoning via Contrastive Agreement

arXiv.org · 2025

cited

Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models

2025

cited

C AN L ARGE L ANGUAGE M ODELS R EASON R OBUSTLY WITH N OISY R ATIONALES ?

Exploring Model Dynamics for Accumulative Poisoning Discovery

International Conference on Machine Learning · 2023

Jailbreak Large Vision-Language Models Through Multi-Modal Linkage Anonymous ACL submission

LiteLMGuard : Seamless and Lightweight On-Device Guardrails for Small Language Models against Quantization Vulnerabilities

Sotabase