benchmark
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Wang and Zhu 2025. "Benchmarking the ancient books capability of multimodal large language models" |
|
0 | 17 | November 15, 2025 |
| Kraus et al. 2025. "A Gold Standard Benchmark Dataset for Digital Humanities" |
|
0 | 7 | September 8, 2025 |