HugGAN Community

non-profit

Activity Feed Request to join this org

AI & ML interests

GANs!

Recent Activity

nielsr submitted a paper about 10 hours ago

Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

gigant authored a paper 5 days ago

Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation

gigant submitted a paper 6 days ago

Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation

View all activity

submitted a paper to Daily Papers about 10 hours ago

Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

Paper • 2605.27295 • Published 2 days ago • 5

authored a paper 5 days ago

Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation

Paper • 2604.27263 • Published 14 days ago • 10

submitted a paper to Daily Papers 6 days ago

Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation

Paper • 2604.27263 • Published 14 days ago • 10

submitted a paper to Daily Papers 6 days ago

Stable Audio 3

Paper • 2605.17991 • Published 10 days ago • 15

posted an update 7 days ago

Post

104

One prompt, three answers - which model is from where?

johko/llm-blind-date

I built a little demo where you give three models (Apertus, Llama, Qwen3) the same prompt and in the end you have to guess which is which just based on their answers.

GIve it a try! ;)

authored a paper 13 days ago

Efficient Pre-Training with Token Superposition

Paper • 2605.06546 • Published 21 days ago • 45

submitted 2 papers to Daily Papers about 1 month ago

Scaling Test-Time Compute for Agentic Coding

Paper • 2604.16529 • Published Apr 16 • 12

Geometric Context Transformer for Streaming 3D Reconstruction

Paper • 2604.14141 • Published Apr 15 • 21

authored a paper about 1 month ago

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation

Paper • 2604.05083 • Published Apr 6

submitted 2 papers to Daily Papers about 2 months ago

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Paper • 2604.04913 • Published Apr 6 • 12

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

Paper • 2603.28130 • Published Mar 30 • 11

authored a paper 2 months ago

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA

Paper • 2603.08501 • Published Mar 9

submitted a paper to Daily Papers 2 months ago

Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders

Paper • 2603.19209 • Published Mar 19 • 6

authored a paper 2 months ago

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published Mar 19 • 3

submitted a paper to Daily Papers 2 months ago

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

Paper • 2603.19017 • Published Mar 19 • 3

submitted a paper to Daily Papers 2 months ago

V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning

Paper • 2603.14482 • Published Mar 15 • 36

submitted a paper to Daily Papers 2 months ago

Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA

Paper • 2603.08501 • Published Mar 9

submitted a paper to Daily Papers 2 months ago

Omnilingual MT: Machine Translation for 1,600 Languages

Paper • 2603.16309 • Published Mar 17 • 23

authored a paper 3 months ago

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65

submitted a paper to Daily Papers 3 months ago

VidEoMT: Your ViT is Secretly Also a Video Segmentation Model

Paper • 2602.17807 • Published Feb 19 • 7