🧠 LLMs - cwensel · Scour

StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis

🔗LLM Workflows Academic

Friend or Foe? Language as an ideological switch in open-weight LLMs under Russian disinformation stress

🔗LLM Workflows Academic

Personalization Meets Safety:Mechanisms,Risks,and Mitigations in Personalized LLMs

🔗LLM Workflows Academic

Imbuing Large Language Models with Bidirectional Logic for Robust Chain Repair

✍️Prompt Engineering Academic

The Amplifying Mirror: Locating and Steering the Partisan Direction inside a Large Language Model

📚RAG Academic

Long Live Fine-Tuning: Task-Specific Transformers Outperform Zero-Shot LLMs for Misinformation Response Classification on Reddit

⚙️MLOps Academic

Breaking the Tokenizer Barrier: On-Policy Distillation across Model Families

⚙️MLOps Academic

A Regret Minimization Framework on Preference Learning in Large Language Models

📚RAG Academic

BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization

⚙️MLOps Academic

Lost in the Non-convex Loss Landscape: How to Fine-tune the Large Time Series Model?

⚙️MLOps Academic

Towards Long-Horizon Vessel Trajectory and Destination Forecasting with Reasoning Large Language Models

🤖AI Coding Academic

Parameter-Efficient Fine-Tuning with Learnable Rank

⚙️MLOps Academic

Korean Culture into LLM Alignment: Toward Cultural Coherence

🏗️Data Engineering Academic

Chiseling Out Efficiency: Structured Skeleton Supervision for Efficient Code Generation

🤖AI Coding Academic

Statistically Reliable LLM-Based Ranking Evaluation via Prediction-Powered Inference

⚙️MLOps Academic

A Unifying Lens on Reward Uncertainty in RLHF

⚙️MLOps Academic

PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis

⚙️MLOps Academic

The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning

⚙️MLOps Academic

Revisiting Vul-RAG: Reproducibility and Replicability of RAG-based Vulnerability Detection with Open-Weight Models

📚RAG Academic

TICoder: A Repository-Level Code Generation Framework with Test-Driven Planning and Implementation-Aware Reuse

✍️Prompt Engineering Academic

No more posts from cwensel's subscribed feeds.

Scour all 25255 feeds Learn more about Feeds

Sign up or log in to see more results

Log in to enable infinite scrolling