INDEX
Explanations
phrases related to discrediting or intimidating actions
phrases related to discrediting and undermining individuals or groups
New Auto-Interp
Negative Logits
Borders
-0.68
Rolls
-0.68
Tok
-0.67
Skies
-0.67
Jets
-0.67
Dungeons
-0.67
Colts
-0.66
Horse
-0.66
Shots
-0.65
Animation
-0.65
POSITIVE LOGITS
rogen
1.12
rogens
1.06
igmat
0.92
dissemin
0.90
rehabilit
0.88
amplify
0.87
eliminate
0.83
analyse
0.79
humili
0.79
defend
0.79
Activations Density 0.141%