INDEX
Explanations
expressions of opinions or recommendations
New Auto-Interp
Negative Logits
my
-0.79
resourceCulture
-0.58
my
-0.57
meiner
-0.55
私に
-0.53
мои
-0.52
私の
-0.52
моя
-0.51
I
-0.51
/**
-0.50
POSITIVE LOGITS
Eſ
0.75
ſelf
0.75
ourselves
0.74
transfieras
0.67
CanadaChoose
0.66
FTFY
0.64
ſelves
0.63
itſelf
0.62
Zapraszamy
0.61
reaſon
0.60
Activations Density 0.478%