INDEX
Explanations
references to urgency or immediate actions
New Auto-Interp
Negative Logits
ëĭ¤ë©´
-0.16
à¯įà®
-0.15
alim
-0.15
meg
-0.15
ught
-0.15
вÑĢемен
-0.15
istrovstvÃŃ
-0.14
именно
-0.14
ultimately
-0.14
soever
-0.14
POSITIVE LOGITS
aneously
0.35
aneous
0.26
grat
0.23
upon
0.20
-release
0.19
vicinity
0.18
iations
0.18
ately
0.17
onset
0.17
olarak
0.16
Activations Density 0.016%