INDEX
Explanations
occurrences of the word "it" and variations of "is" in different contexts
New Auto-Interp
Negative Logits
haust
-0.15
hand
-0.14
Ùĩ
-0.14
esson
-0.14
hart
-0.13
712
-0.13
ulture
-0.13
nbsp
-0.13
orden
-0.13
hook
-0.13
POSITIVE LOGITS
iner
0.36
unes
0.26
chy
0.24
self
0.23
inerary
0.23
zelf
0.23
/th
0.21
'll
0.19
SELF
0.19
asca
0.18
Activations Density 0.540%