INDEX
Explanations
instances of the word "the" in various contexts
New Auto-Interp
Negative Logits
vale
-0.17
igon
-0.16
olik
-0.15
اصÙĦ
-0.14
edback
-0.13
ĵ°
-0.13
ius
-0.13
лади
-0.13
arak
-0.13
ystate
-0.13
POSITIVE LOGITS
sake
0.66
purposes
0.58
benefit
0.46
purpose
0.40
duration
0.38
reasons
0.35
Benefit
0.33
duration
0.28
reason
0.28
foreseeable
0.28
Activations Density 0.128%