INDEX
Explanations
proper nouns
occurrences of the word "is"
New Auto-Interp
Negative Logits
stakes
-0.67
thumbnail
-0.64
¥ŀ
-0.63
ruciating
-0.59
downt
-0.58
oats
-0.57
¥µ
-0.55
nick
-0.54
MFT
-0.54
allotted
-0.54
POSITIVE LOGITS
abeth
1.31
earch
1.06
ection
1.05
ystem
1.02
ource
1.02
aurus
1.01
peed
0.99
terness
0.99
kell
0.98
TRY
0.96
Activations Density 0.062%