Topic cluster

research

20 sources grouped by AFBytes in Ai

AFBytes briefing

Optimized GPU kernels can accelerate AI workloads and lower energy consumption in data centers.

Key entities

Abstract
Agents
Language
Models
Reasoning
Large
Large Language Models
Agentic
Benchmark
Diagnostic
Llms
Neural

Ai arxiv.org · Jun 1, 2026 04:00 UTC

GPU Forecasters Using Language Models

The paper investigates language models acting as selective surrogates to forecast and optimize GPU kernel runtimes. The approach seeks to improve performance without exhaustive profiling.

Ai arxiv.org · Jun 1, 2026 04:00 UTC

Honesty of LLMs as Bargaining Agents

The study examines how large language models perform as used-car sales agents when information is incomplete. It measures tendencies toward honesty or credulity under varying conditions.

Ai arxiv.org · Jun 1, 2026 04:00 UTC

Linguistic Inductive Bias of LLMs for Spatial Reasoning

Researchers characterize the strengths and weaknesses of linguistic inductive biases in large language models when applied to spatial reasoning for navigation. The work identifies specific failure mo…

Ai arxiv.org · Jun 1, 2026 04:00 UTC

PithTrain Compact MoE Training System

PithTrain offers a compact, agent-native system for training mixture-of-experts models. The design targets improved efficiency and integration with agent workflows.

Ai arxiv.org · Jun 1, 2026 04:00 UTC

Diagnostic Reasoning for Aspect Sentiment Triplet Extraction

The paper proposes fine-grained verification through diagnostic reasoning supervision for aspect sentiment triplet extraction tasks. The method aims to increase reliability of sentiment models on nua…

Ai arxiv.org · Jun 1, 2026 04:00 UTC

Training-Free Decoder-Only Attention for Simultaneous Translation

The authors present DOA, a decoder-only attention policy that enables training-free simultaneous translation for long-form speech. The method targets latency reduction while maintaining translation q…

Ai arxiv.org · Jun 1, 2026 04:00 UTC

Target-Side Paraphrase Augmentation for Sign Language Translation

The paper proposes using large language models to generate target-side paraphrases that enhance sign language translation systems. This approach aims to address data scarcity and improve model robust…

Ai arxiv.org · Jun 1, 2026 04:00 UTC

Neuro-symbolic Syntactic Parsing with CYK Algorithm

The paper describes a method to shape neural network behavior using the classical CYK algorithm within a neuro-symbolic framework. The hybrid approach targets improved parsing accuracy on standard be…

Ai arxiv.org · Jun 1, 2026 04:00 UTC

Scaling Conversational Hungarian ASR

The work presents the BEA-Dialogue+ corpus to enable larger-scale training of conversational Hungarian automatic speech recognition systems. The corpus targets gaps in existing Hungarian speech resources.

Ai arxiv.org · Jun 1, 2026 04:00 UTC

Skill Availability in Large-Language-Model Agents

The study evaluates how the availability and granularity of skills presented to large language model agents influence task completion. Findings come from systematic experiments on the SkillsBench ben…