INDEX
Explanations
instances of the word "Next" indicating navigation prompts or transitions in an article
New Auto-Interp
Negative Logits
ÏĦε
-0.16
jr
-0.16
ernity
-0.16
तर
-0.14
Mechanics
-0.14
otron
-0.14
onn
-0.14
ylum
-0.13
raci
-0.13
ãĥ
-0.13
POSITIVE LOGITS
še
0.16
bid
0.15
uss
0.15
cko
0.15
aby
0.15
SG
0.15
dma
0.15
ISTA
0.14
anta
0.14
Vita
0.14
Activations Density 0.002%