INDEX
Explanations
phrases related to systems or processes, especially those involving criticism or rigging
New Auto-Interp
Negative Logits
earnest
-0.76
azines
-0.71
tein
-0.70
Emin
-0.66
Dodgers
-0.64
joy
-0.63
Aerial
-0.63
Ranch
-0.62
terday
-0.62
Padres
-0.61
POSITIVE LOGITS
atics
1.09
ically
0.94
wide
0.93
ologies
0.92
overseen
0.85
rigged
0.81
governed
0.80
atically
0.78
opathy
0.78
ifice
0.77
Activations Density 0.090%