INDEX
Explanations
key improvements and explanations
New Auto-Interp
Negative Logits
jīn
0.36
summons
0.33
vó
0.32
notor
0.32
सँग
0.31
undertakings
0.31
collars
0.31
embry
0.30
humoral
0.30
semn
0.30
POSITIVE LOGITS
for
0.51
if
0.47
three
0.46
four
0.44
var
0.44
This
0.43
warm
0.41
8
0.41
other
0.41
this
0.41
Activations Density 0.081%