INDEX
Explanations
phrases related to actions or behaviors being done by a specific subject
phrases related to media criticism and accountability
New Auto-Interp
Negative Logits
ometer
-0.75
é»Ĵ
-0.73
externalActionCode
-0.71
hiba
-0.71
ŃĶ
-0.67
isl
-0.67
neau
-0.66
ģ«
-0.66
wagen
-0.66
EPA
-0.66
POSITIVE LOGITS
itself
1.02
its
0.91
Its
0.87
me
0.84
us
0.84
dudes
0.83
ITS
0.79
gays
0.76
kids
0.74
thugs
0.74
Activations Density 0.725%