INDEX
Explanations
phrases or words related to evaluations or judgments
terms related to moral and ethical qualifications
New Auto-Interp
Negative Logits
berman
-0.78
Denmark
-0.73
shire
-0.73
LAND
-0.69
avorite
-0.65
Webster
-0.65
hire
-0.64
Sutherland
-0.63
Danish
-0.62
Cumber
-0.62
POSITIVE LOGITS
itatively
1.15
ifications
1.08
ific
0.98
ifying
0.96
ãĥ£
0.93
ifiable
0.93
ifiers
0.92
ifi
0.91
ified
0.90
ities
0.89
Activations Density 0.029%