INDEX
Explanations
reading comprehension questions and passages
New Auto-Interp
Negative Logits
responsabil
0.39
fopen
0.39
rohk
0.39
honours
0.39
Parrocchia
0.39
পারিত
0.39
wikkel
0.38
navidad
0.38
หลวง
0.38
cud
0.37
POSITIVE LOGITS
questions
0.66
passages
0.65
रीजनिंग
0.61
passage
0.60
Passage
0.59
Questions
0.58
Reasoning
0.55
questions
0.54
section
0.53
प्रश्नों
0.52
Activations Density 0.012%