INDEX
Explanations
instances of numerical values and dates in the text
New Auto-Interp
Negative Logits
ıza
-0.08
poil
-0.08
iju
-0.08
Ư
-0.08
имо
-0.07
shaw
-0.07
LENG
-0.07
_________________↵↵
-0.07
baÅŁlan
-0.07
oux
-0.07
POSITIVE LOGITS
TAG
0.07
463
0.06
825
0.06
DOI
0.06
tags
0.05
Fol
0.05
contested
0.05
fol
0.05
467
0.05
OO
0.05
Activations Density 0.039%