INDEX
Explanations
instances of the phrase "Il" or similar structures in the text
New Auto-Interp
Negative Logits
wich
-0.19
warz
-0.16
Abbott
-0.16
Abb
-0.15
abb
-0.15
ardon
-0.15
θι
-0.14
ä¹±
-0.14
иÑĪ
-0.14
ilha
-0.14
POSITIVE LOGITS
á»ijt
0.15
STRICT
0.15
ObjectType
0.15
rane
0.14
NSNotification
0.14
swer
0.14
Waist
0.14
Tyto
0.14
DÃŃky
0.14
æĪ¸
0.13
Activations Density 0.004%