INDEX
Explanations
statements related to legislation and political discussions
New Auto-Interp
Negative Logits
Bos
-0.71
Romans
-0.69
Freak
-0.69
Crusader
-0.66
Hats
-0.63
Salam
-0.63
Pony
-0.62
Towers
-0.62
Spears
-0.62
Dungeons
-0.61
POSITIVE LOGITS
rogens
0.90
rogen
0.84
rehabilit
0.83
dissemin
0.79
distribute
0.78
refine
0.77
manipulate
0.77
/+
0.77
circulate
0.76
analyze
0.76
Activations Density 1.856%