INDEX
Explanations
words related to negative actions or qualities, such as discrediting, distaste, or leniency
key terms related to judgment and evaluation
New Auto-Interp
Negative Logits
Kit
-0.87
Kits
-0.80
BI
-0.77
®
-0.77
meg
-0.75
Helic
-0.74
Cu
-0.73
iP
-0.70
TECH
-0.70
HS
-0.69
POSITIVE LOGITS
itionally
1.02
ested
1.01
animous
0.99
unction
0.97
irable
0.94
eful
0.94
empt
0.93
atically
0.93
iless
0.91
iencies
0.91
Activations Density 0.308%