FAQ
Home
FAQ
HuggingFace
Why did you create this?
+
How did you pick the types of questions?
+
How did you generate these questions?
+
How are the responses graded?
+
Other benchmarks have hundreds or thousands of questions. Why so few?
+
What other experiments would you run with this dataset, given more time & resources?
+
How did you actually run all these models? I want to see your code.
+