INDEX
Explanations
mentions of specific names or titles
instances of intent or knowledge related to actions and accountability
New Auto-Interp
Negative Logits
LGBT
-0.84
Yeah
-0.80
Fuck
-0.79
Yeah
-0.79
Pretty
-0.78
gay
-0.77
yeah
-0.76
yeah
-0.76
Fuck
-0.74
Pretty
-0.74
POSITIVE LOGITS
sufficient
1.02
cumbers
0.96
sufficiently
0.94
ascertain
0.94
satisf
0.93
spontaneously
0.92
diligent
0.92
ascert
0.91
contempor
0.90
inadvert
0.90
Activations Density 1.581%