INDEX
Explanations
scientific classifications or categories in biological contexts
New Auto-Interp
Negative Logits
myſelf
-2.68
Monfieur
-2.65
itſelf
-2.60
Efq
-2.46
pleaſure
-2.35
purpoſe
-2.35
Jefus
-2.32
Anſ
-2.30
Houſe
-2.28
Theſe
-2.27
POSITIVE LOGITS
1.06
B
0.86
<eos>
0.85
C
0.85
(
0.84
l
0.84
O
0.84
I
0.83
A
0.83
↵↵
0.83
Activations Density 0.295%