INDEX
Negative Logits
utility
-0.73
utilities
-0.73
surn
-0.72
NetMessage
-0.70
coales
-0.68
conversion
-0.68
resid
-0.67
polyg
-0.66
pheus
-0.64
saline
-0.63
POSITIVE LOGITS
://
1.51
0.94
ONSORED
0.85
youtu
0.85
:/
0.83
gow
0.78
Hur
0.77
books
0.75
www
0.74
HB
0.73
Activations Density 0.007%