INDEX
Explanations
political statements or opinions
the phrase "for" followed by various contexts or examples
New Auto-Interp
Negative Logits
è¦ļéĨĴ
-0.74
marine
-0.68
zona
-0.63
(=
-0.63
Book
-0.60
aukee
-0.59
istine
-0.59
omnia
-0.58
halla
-0.57
ãĥIJ
-0.57
POSITIVE LOGITS
example
1.45
instance
1.44
cing
1.39
starters
1.38
ced
1.26
gotten
1.20
give
1.19
bes
1.12
decades
1.12
bidden
1.11
Activations Density 0.057%