INDEX
Explanations
phrases indicating a problem or issue with something
phrases questioning the existence of problems or issues
New Auto-Interp
Negative Logits
mart
-0.88
swick
-0.76
eport
-0.67
lehem
-0.65
Pesh
-0.64
pain
-0.63
onder
-0.63
grown
-0.62
izational
-0.62
unden
-0.61
POSITIVE LOGITS
regard
0.87
regards
0.80
ĪĴ
0.78
respect
0.71
electrons
0.65
Revenge
0.65
aroo
0.64
dignity
0.62
otin
0.62
Ĥª
0.62
Activations Density 0.054%