INDEX
Explanations
phrases related to support or endorsement
New Auto-Interp
Negative Logits
hran
-0.73
Course
-0.69
ptin
-0.65
textures
-0.64
vu
-0.63
resign
-0.62
scram
-0.61
NetMessage
-0.61
ultane
-0.60
auc
-0.60
POSITIVE LOGITS
TODAY
0.77
ESE
0.68
Patreon
0.67
LW
0.66
Spons
0.66
Supported
0.65
MSN
0.64
ory
0.64
BBC
0.64
fund
0.63
Activations Density 0.041%