INDEX
Explanations
phrases related to communication or interaction between individuals
instances of the word "spoken."
New Auto-Interp
Negative Logits
Protect
-0.71
Eastern
-0.65
adj
-0.62
rather
-0.60
nen
-0.59
accompanied
-0.57
Ma
-0.56
abet
-0.55
TRANS
-0.55
NL
-0.55
POSITIVE LOGITS
nor
1.25
yet
1.07
yet
1.06
anymore
1.04
anywhere
1.00
anything
0.98
anybody
0.94
since
0.90
dime
0.89
any
0.88
Activations Density 0.214%