INDEX
Explanations
words related to evaluation or appraisal
terms related to approval ratings or evaluations
New Auto-Interp
Negative Logits
ãģĮ
-0.74
SHE
-0.74
Hicks
-0.70
Wi
-0.65
HELP
-0.61
Jessie
-0.60
Rudd
-0.60
chickens
-0.60
additionally
-0.59
feds
-0.58
POSITIVE LOGITS
val
4.51
vals
2.56
VAL
2.09
Val
1.76
val
1.56
va
1.40
vale
1.39
eval
1.35
Val
1.34
valued
1.33
Activations Density 0.007%