INDEX
Explanations
numerical values and symbols
New Auto-Interp
Negative Logits
houſe
-0.88
greateſt
-0.87
purpoſe
-0.85
xenia
-0.85
ſta
-0.84
sério
-0.84
NSCoder
-0.81
Efq
-0.81
ſame
-0.80
ſtate
-0.79
POSITIVE LOGITS
(
0.47
tens
0.40
law
0.39
ことなく
0.38
Trans
0.38
hite
0.37
dailymail
0.37
attending
0.37
trans
0.36
trans
0.36
Activations Density 0.018%