INDEX
Explanations
references to the character "Il."
New Auto-Interp
Negative Logits
itſelf
-1.05
Monfieur
-1.05
Majefty
-1.04
Anſ
-1.03
Conſ
-1.01
faſt
-0.95
Diſ
-0.91
Chriftian
-0.91
cauſe
-0.90
ſche
-0.90
POSITIVE LOGITS
Il
3.01
il
2.63
Il
2.59
IL
1.62
Ill
1.20
illi
1.10
ll
1.02
İl
1.00
ill
0.98
ILL
0.97
Activations Density 0.049%