📊 Model Evaluation - flicksinfants1y · Scour

MultiToP: Learning to Patch Visual Tokens to Mitigate Hallucinations in Video Large Multimodal Models

🔬Hallucination Detection Academic

Beyond English benchmarks: clinical llm evaluation in Brazilian Portuguese

🛡LLM safety Academic

GuardNet: Ensemble Strategies of Shallow Neural Networks for Robust Prompt Injection and Jailbreak Detection

🛡LLM safety Academic

Optimizing 2D Input Representations and Sub-phase Fusion Strategies for Differential Diagnosis of Asthma and COPD Using CNN- and GRU-Based Networks

🧬Embeddings Academic

Sample-Efficient LLM-Based Detection of Malicious Web Server Logs with Forensically Explainable Reasoning

✍️Prompt Engineering Academic

Data-Driven Runway and Taxiway Exits Prediction of Landing Aircraft: A Case Study at Hartsfield-Jackson Atlanta International Airport

🎛️Fine-Tuning Academic

Rank Intervals for Leaderboards: A Hierarchical Framework for Model Evaluation

🛡️Red Teaming Academic

UrduMMLU: A Massive Multitask Benchmark for Urdu Language Understanding

✍️Prompt Engineering Academic

Attention Expansion: Enhancing Keyphrase Extraction from Long Documents with Attention-Augmented Contextualized Embeddings

💭Context Management Academic

Null-Space Constrained Low-Rank Adaptation for Response-Specified Large Language Model Unlearning

🎛️Fine-Tuning Academic

Aggregating LLM-Based Weak Verifiers for Spatial Layout Generation

🎯AI Alignment Academic

MechLens: Late Crystallization of Factual Knowledge Explains Intervention Effectiveness in Language Models

🤖AI Academic

The Order Matters: Sequential Fine-Tuning of LLaMA for Coherent Automated Essay Scoring

🤖AI Academic

Improving Answer Extraction in Context-based Question Answering Systems Using LLMs

✍️Prompt Engineering Academic

FusionVul: A Multimodal Feature Fusion Framework for Source Code Vulnerability Detection

🛡️Red Teaming Academic

Multilingual Detection of Alzheimer's Disease from Speech: A Cross-Linguistic Transfer Learning Approach

🎛️Fine-Tuning Academic

Paediatric-HGNN: A Hybrid Heterogeneous Graph Neural Network for Detecting Disfluency in Children's Speech via Multiscale Acoustic Fusion

🔬Hallucination Detection Academic

ATTAIN: Automated Exploit Failure Analysis through Trace-Driven Diff Analysis

🛡️Red Teaming Academic

Deep Learning-assisted AMD Staging based on OCT and OCT Angiography

🔬Hallucination Detection Academic

Anomaly Detection for Electro-Hydrostatic Actuators using LSTM Autoencoder

🤖AI Academic

Sign up or log in to see more results

Log in to enable infinite scrolling