INDEX
Explanations
phrases related to concealment, solicitation, and controversial topics
terms related to disguise, identity concealment, and social issues
New Auto-Interp
Negative Logits
Kaplan
-0.65
Weston
-0.63
Bellev
-0.61
ObamaCare
-0.60
Kern
-0.59
Ramos
-0.58
Eucl
-0.56
Williamson
-0.56
Sherman
-0.56
Aval
-0.55
POSITIVE LOGITS
depending
1.11
thereof
1.08
alike
0.90
versa
0.87
abouts
0.79
depending
0.79
#$#$
0.78
Else
0.71
peat
0.70
nam
0.69
Activations Density 0.635%