INDEX
Explanations
numbers within text
numerical or chronological references in a text
New Auto-Interp
Negative Logits
heit
-0.70
boro
-0.64
diseng
-0.63
ardless
-0.62
ocial
-0.60
umni
-0.59
abouts
-0.59
isexual
-0.56
activity
-0.54
assador
-0.53
POSITIVE LOGITS
Scroll
0.75
However
0.69
Debor
0.68
reditary
0.68
ECK
0.66
Until
0.65
³³³
0.63
Lear
0.63
Specifically
0.63
SEE
0.63
Activations Density 0.627%