INDEX
Explanations
phrases related to being criticized or facing scrutiny
expressions related to being criticized or endangered
New Auto-Interp
Negative Logits
erd
-0.63
depress
-0.63
equ
-0.63
alone
-0.59
ourage
-0.58
olen
-0.56
Camden
-0.56
Lot
-0.55
annis
-0.55
Loaded
-0.55
POSITIVE LOGITS
scrutiny
0.81
ÃĽ
0.72
BAT
0.69
zai
0.69
Ø©
0.68
é¾įå¥ij士
0.67
èª
0.67
士
0.65
Ú
0.64
ÅŁ
0.63
Activations Density 0.068%