INDEX
Explanations
proper nouns or names related to individuals, particularly in a context involving achievements or roles
New Auto-Interp
Negative Logits
Tro
-0.18
Tro
-0.17
Pazar
-0.16
tro
-0.16
tant
-0.16
timespec
-0.14
ropy
-0.14
IVEN
-0.14
oran
-0.14
ún
-0.14
POSITIVE LOGITS
(AP
0.26
Ap
0.23
ap
0.23
/AP
0.21
AP
0.20
/ap
0.19
.Ap
0.19
(ap
0.18
amiento
0.18
ап
0.17
Activations Density 0.011%