INDEX
Explanations
phrases indicating examples or instances of something
instances of the word "For" indicating examples or illustrations
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.71
marine
-0.66
arest
-0.64
zona
-0.61
Introduced
-0.60
æĺ¯
-0.58
smanship
-0.58
itiz
-0.56
BP
-0.56
ucl
-0.55
POSITIVE LOGITS
gotten
1.34
cing
1.32
example
1.27
bidden
1.25
instance
1.20
ced
1.14
starters
1.08
give
1.06
getting
0.94
comparison
0.92
Activations Density 0.071%