INDEX
Explanations
references to the Bible and its significance in discussions
New Auto-Interp
Negative Logits
ロウィン
-0.79
faſt
-0.79
Infórmanos
-0.78
ſche
-0.77
ſtate
-0.75
ſſung
-0.75
ſch
-0.75
ſont
-0.74
queſta
-0.74
stiefe
-0.73
POSITIVE LOGITS
along
0.56
↵↵
0.43
So
0.43
1
0.43
The
0.42
0.40
Along
0.40
between
0.40
decimos
0.39
Around
0.39
Activations Density 0.377%