INDEX
Explanations
mentions of something being 'lesser'
New Auto-Interp
Negative Logits
Pinball
-0.71
lish
-0.67
Unlock
-0.64
lio
-0.63
mberg
-0.61
bal
-0.60
ãĤ§
-0.60
lyn
-0.60
Bo
-0.59
Population
-0.59
POSITIVE LOGITS
minded
1.01
sized
0.99
than
0.98
worldly
0.83
ones
0.79
iberal
0.79
than
0.78
incarn
0.78
versions
0.77
fractions
0.77
Activations Density 0.033%