INDEX
Explanations
phrases indicating inquiry or observation
words and phrases seeking information
New Auto-Interp
Negative Logits
.
-0.42
,
-0.40
ſelf
-0.39
emperador
-0.37
The
-0.35
The
-0.35
Houſe
-0.35
inconvénients
-0.31
Currently
-0.31
dispositif
-0.31
POSITIVE LOGITS
חיצוניים
0.81
rungsseite
0.78
Geſch
0.73
oprot
0.71
mbgg
0.71
müſſen
0.69
ſſung
0.66
iſchen
0.66
שוליים
0.65
verſch
0.65
Activations Density 0.073%