INDEX
Explanations
emotional expressions related to help, feeling, and beliefs
New Auto-Interp
Negative Logits
autorytatywna
-0.78
itſelf
-0.73
ainfi
-0.72
Handlung
-0.71
AllAfrica
-0.71
zelve
-0.71
ngOn
-0.68
MediatR
-0.68
@}
-0.68
úrate
-0.68
POSITIVE LOGITS
inevitable
0.75
inevitably
0.72
inevit
0.61
unavoidable
0.61
inescapable
0.60
Ine
0.57
undoubtedly
0.56
undeniable
0.56
inev
0.55
undeniably
0.54
Activations Density 0.159%