INDEX
Explanations
names of political figures
punctuation, specifically commas
New Auto-Interp
Negative Logits
nih
-0.71
grain
-0.66
ood
-0.65
continental
-0.65
amorph
-0.64
interstitial
-0.63
eries
-0.61
Availability
-0.61
idepress
-0.60
nightmares
-0.60
POSITIVE LOGITS
meanwhile
1.21
however
0.93
unsurprisingly
0.85
huh
0.84
pictured
0.84
Bullets
0.80
citing
0.78
meantime
0.73
moreover
0.72
Sr
0.70
Activations Density 0.344%