INDEX
Explanations
Twitter handles
instances of a specific character or symbol
New Auto-Interp
Negative Logits
ordinate
-0.78
²¾
-0.75
wagen
-0.75
compens
-0.75
ixel
-0.74
ModLoader
-0.72
apers
-0.72
interstitial
-0.71
perate
-0.68
annex
-0.67
POSITIVE LOGITS
————————
1.08
————
0.94
VIDEOS
0.94
————————————————
0.84
avanaugh
0.81
Rabbi
0.80
ï¸ı
0.76
RT
0.76
Jem
0.75
Kimber
0.74
Activations Density 0.046%