LLMs

Feeds to Scour
SubscribedAll
Scoured 248 posts in 7.0 ms

StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis

 🔗LLM Workflows  Content type: Academic
arxiv.org·

Friend or Foe? Language as an ideological switch in open-weight LLMs under Russian disinformation stress

 🔗LLM Workflows  Content type: Academic
arxiv.org·

Personalization Meets Safety:Mechanisms,Risks,and Mitigations in Personalized LLMs

 🔗LLM Workflows  Content type: Academic
arxiv.org·

Imbuing Large Language Models with Bidirectional Logic for Robust Chain Repair

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·

The Amplifying Mirror: Locating and Steering the Partisan Direction inside a Large Language Model

 📚RAG  Content type: Academic
arxiv.org·

Long Live Fine-Tuning: Task-Specific Transformers Outperform Zero-Shot LLMs for Misinformation Response Classification on Reddit

 ⚙️MLOps  Content type: Academic
arxiv.org·

Breaking the Tokenizer Barrier: On-Policy Distillation across Model Families

 ⚙️MLOps  Content type: Academic
arxiv.org·

A Regret Minimization Framework on Preference Learning in Large Language Models

 📚RAG  Content type: Academic
arxiv.org·

BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization

 ⚙️MLOps  Content type: Academic
arxiv.org·

Lost in the Non-convex Loss Landscape: How to Fine-tune the Large Time Series Model?

 ⚙️MLOps  Content type: Academic
arxiv.org·

Towards Long-Horizon Vessel Trajectory and Destination Forecasting with Reasoning Large Language Models

 🤖AI Coding  Content type: Academic
arxiv.org·

Parameter-Efficient Fine-Tuning with Learnable Rank

 ⚙️MLOps  Content type: Academic
arxiv.org·

Korean Culture into LLM Alignment: Toward Cultural Coherence

 🏗️Data Engineering  Content type: Academic
arxiv.org·

Chiseling Out Efficiency: Structured Skeleton Supervision for Efficient Code Generation

 🤖AI Coding  Content type: Academic
arxiv.org·

Statistically Reliable LLM-Based Ranking Evaluation via Prediction-Powered Inference

 ⚙️MLOps  Content type: Academic
arxiv.org·

A Unifying Lens on Reward Uncertainty in RLHF

 ⚙️MLOps  Content type: Academic
arxiv.org·

PEFT of SLM for Telecommunications Customer Support: A Comparative Study of LoRA Configurations with Energy Consumption Analysis

 ⚙️MLOps  Content type: Academic
arxiv.org·

The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning

 ⚙️MLOps  Content type: Academic
arxiv.org·

Revisiting Vul-RAG: Reproducibility and Replicability of RAG-based Vulnerability Detection with Open-Weight Models

 📚RAG  Content type: Academic
arxiv.org·

TICoder: A Repository-Level Code Generation Framework with Test-Driven Planning and Implementation-Aware Reuse

 ✍️Prompt Engineering  Content type: Academic
arxiv.org·

No more posts from cwensel's subscribed feeds.

Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous item
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help