INDEX
Explanations
links starting with "https://t.co/" and including specific character combinations
punctuation marks, specifically periods at the end of statements
New Auto-Interp
Negative Logits
handwriting
-0.72
Speech
-0.64
Haram
-0.61
laborers
-0.58
ecosystems
-0.57
resettlement
-0.57
temperament
-0.56
charms
-0.55
suicides
-0.55
nomine
-0.55
POSITIVE LOGITS
imgur
0.87
0.79
esp
0.77
redd
0.75
nz
0.75
gov
0.75
assetsadobe
0.74
wikipedia
0.73
rev
0.72
co
0.72
Activations Density 0.022%