INDEX
Explanations
phrases related to significant events or occurrences, particularly in a societal or organizational context
New Auto-Interp
Negative Logits
edith
-0.15
rlen
-0.15
vester
-0.15
using
-0.15
omination
-0.15
raction
-0.15
imedia
-0.14
Genç
-0.14
apiro
-0.14
едж
-0.14
POSITIVE LOGITS
div
0.16
782
0.16
Div
0.16
Wald
0.15
gir
0.14
[â̦]
0.14
º
0.14
Owner
0.14
Cal
0.14
Mic
0.13
Activations Density 0.073%