INDEX
Explanations
phrases or terms related to abstract concepts and complex social issues
New Auto-Interp
Negative Logits
orn
-0.71
consolation
-0.68
Finnish
-0.67
Belfast
-0.63
Byzantine
-0.62
Georgian
-0.62
constructive
-0.61
Nordic
-0.61
sewing
-0.61
festive
-0.61
POSITIVE LOGITS
would
1.30
should
1.24
was
1.22
has
1.20
could
1.19
might
1.18
had
1.17
were
1.16
will
1.12
doesn
1.07
Activations Density 0.261%