INDEX
Explanations
specific sections or categories within a larger context
occurrences of the word "section" and its variations
New Auto-Interp
Negative Logits
cia
-0.79
Iv
-0.77
opio
-0.70
Grand
-0.68
ILLE
-0.67
monetary
-0.63
cffff
-0.62
gripping
-0.60
claimed
-0.60
ce
-0.60
POSITIVE LOGITS
sections
0.84
lation
0.82
witz
0.79
enium
0.79
crew
0.74
section
0.73
owell
0.73
bars
0.70
edin
0.70
isions
0.70
Activations Density 0.010%