INDEX
Explanations
words related to political and controversial social topics
New Auto-Interp
Negative Logits
ĸļ
-0.76
PDATE
-0.74
AUD
-0.67
crunch
-0.65
STD
-0.65
NPR
-0.64
actionDate
-0.63
âĸĪâĸĪâĸĪâĸĪâĸĪâĸĪâĸĪâĸĪ
-0.63
Lear
-0.63
fortunate
-0.62
POSITIVE LOGITS
phia
1.02
tein
0.89
ensis
0.88
hire
0.86
Parish
0.84
Palace
0.84
utics
0.83
®
0.81
oulos
0.77
pta
0.76
Activations Density 2.549%