INDEX
Explanations
expressions of personal opinions and reflections on events or choices
New Auto-Interp
Negative Logits
unction
-0.19
elsinki
-0.17
unden
-0.15
opis
-0.15
ohana
-0.15
hed
-0.15
inho
-0.14
persons
-0.14
seems
-0.14
Ñģб
-0.14
POSITIVE LOGITS
æģ
0.17
angan
0.16
dar
0.16
ayi
0.15
krom
0.14
dư
0.14
δεδο
0.14
ardin
0.14
å¢
0.13
Gram
0.13
Activations Density 0.330%