Skip to content
@HiThink-Research

HiThink Research

HiThink-Research

Popular repositories Loading

  1. BizFinBench BizFinBench Public

    A Business-Driven Real-World Financial Benchmark for Evaluating LLMs

    Python 217 8

  2. MME-Finance MME-Finance Public

    [MM 2025] A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning

    Python 43 4

  3. GAGE GAGE Public

    General AI evaluation and Gauge Engine. A unified evaluation engine for LLMs, MLLMs, audio, and diffusion models.

    Python 28 5

  4. BizFinBench.v2 BizFinBench.v2 Public

    BizFinBench.v2: A Unified Offline–Online Bilingual Benchmark for Expert-Level Financial Capability Evaluation of LLMs

    Python 17 1

  5. FinMTM FinMTM Public

    FinMTM: A Multi-Turn Multimodal Benchmark for Financial Reasoning and Agent Evaluation

    Python 15

  6. PuzzleClone PuzzleClone Public

    PuzzleClone: An SMT-Powered Framework for Synthesizing Verified Mathematical Reasoning Data

    Python 5

Repositories

Showing 10 of 10 repositories

Top languages

Loading…

Most used topics

Loading…