INDEX
Explanations
the start of new sections of text
Technical/academic document excerpts
New Auto-Interp
Negative Logits
contentLoaded
-0.52
ilan
-0.50
://$
-0.48
pas
-0.46
<bos>
-0.46
AutoScaleMode
-0.44
Walkover
-0.43
<>",
-0.43
naman
-0.43
rena
-0.43
POSITIVE LOGITS
myſelf
0.54
Theſe
0.52
Monfieur
0.52
RUnlock
0.50
uttavia
0.48
tanleria
0.48
Cæsar
0.47
клопе
0.47
Jefus
0.46
becauſe
0.46
Activations Density 0.632%