Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
cs.SE updates on arXiv.org
rss.arxiv.org
Improving
MPI
Error Detection and Repair with Large Language Models and Bug
References
arxiv.org
·
4w
A Synthesis Method of Safe Rust Code Based on
Pushdown
Colored
Petri
Nets
arxiv.org
·
4w
IndustryCode
: A
Benchmark
for Industry Code Generation
arxiv.org
·
4w
Evaluating the Formal Reasoning Capabilities of Large Language Models through
Chomsky
Hierarchy
arxiv.org
·
4w
An
Initial
Exploration of
Contrastive
Prompt Tuning to Generate Energy-Efficient Code
arxiv.org
·
4w
SkillRT
:
Compiling
Skills for Efficient Execution Everywhere
arxiv.org
·
4w
DrugPlayGround
: Benchmarking Large Language Models and
Embeddings
for Drug Discovery
arxiv.org
·
4w
UniCon
: A Unified System for Efficient Robot Learning
Transfers
arxiv.org
·
14w
A Survey of Real-Time Support, Analysis, and
Advancements
in
ROS
2
arxiv.org
·
15w
Is the
Cure
Still Worse Than the Disease? Test
Overfitting
by LLMs in Automated Program Repair
arxiv.org
·
23w
An
Empirical
Study of Testing Practices in Open Source AI Agent
Frameworks
and Agentic Applications
arxiv.org
·
31w
Probing Pre-trained Language Models on Code Changes: Insights from
ReDef
, a High-Confidence Just-in-Time
Defect
Prediction Dataset
arxiv.org
·
33w
StructEval
: Benchmarking LLMs' Capabilities to Generate Structural
Outputs
arxiv.org
·
49w
A Multi-Language
Perspective
on the
Robustness
of LLM Code Generation
arxiv.org
·
53w
DOne:
Decoupling
Structure and
Rendering
for High-Fidelity Design-to-Code Generation
arxiv.org
·
4w
LLMs as
Idiomatic
Decompilers
: Recovering High-Level Code from x86-64 Assembly for Dart
arxiv.org
·
4w
Computational Foundations for Strategic
Coopetition
:
Formalizing
Sequential Interaction and Reciprocity
arxiv.org
·
4w
ProdCodeBench
: A
Production-Derived
Benchmark for Evaluating AI Coding Agents
arxiv.org
·
4w
Evaluation of
gNB
Monostatic
Sensing for UAV Use Case
arxiv.org
·
4w
EXHIB
: A Benchmark for Realistic and Diverse Evaluation of Function
Similarity
in the Wild
arxiv.org
·
4w
« Page 47
·
Page 49 »
Log in to enable infinite scrolling
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help