INDEX
Explanations
verbs related to moving or changing states of being
New Auto-Interp
Negative Logits
いな
-0.49
Lord
-0.48
estekak
-0.47
rolla
-0.45
“
-0.44
am
-0.44
되었
-0.44
«
-0.43
Geographie
-0.42
der
-0.42
POSITIVE LOGITS
विश्वसनीयता
0.91
occafion
0.80
Shakspeare
0.71
Cister
0.70
batore
0.69
RegressionTest
0.69
seamnă
0.68
chofe
0.67
Antilles
0.66
poffe
0.66
Activations Density 0.307%