INDEX
Explanations
references to research findings or investigation outcomes
phrases mentioning the results or outcomes of investigations, trials, or elections
New Auto-Interp
Negative Logits
ño
-0.72
ener
-0.72
horn
-0.68
anty
-0.68
Rew
-0.66
GREEN
-0.66
ÃŁ
-0.66
oster
-0.64
ept
-0.64
arten
-0.62
POSITIVE LOGITS
deed
0.65
preliminary
0.63
Roe
0.61
©¶æ
0.60
ãĥĢ
0.59
ursday
0.59
Ĥ¬
0.58
RELE
0.57
disse
0.57
{}0.56
Activations Density 0.157%