INDEX
Explanations
information or explanations about specific topics or concepts
empty or non-informative text segments
New Auto-Interp
Negative Logits
psc
-0.61
Prime
-0.58
omever
-0.57
ividually
-0.56
sole
-0.56
helicop
-0.56
sic
-0.55
enegger
-0.54
ovan
-0.54
Pg
-0.53
POSITIVE LOGITS
nutshell
0.69
fascinating
0.65
âĢº
0.62
topics
0.62
myths
0.61
misconceptions
0.61
perspectives
0.60
overview
0.59
Difference
0.58
basics
0.58
Activations Density 0.506%