INDEX
Explanations
adjectives and verbs related to negative or critical opinions
adjectives and participles that express quality or condition
New Auto-Interp
Negative Logits
umbn
-0.74
othal
-0.73
ailable
-0.72
anni
-0.72
pei
-0.72
othy
-0.70
______
-0.68
athan
-0.67
obook
-0.66
ajor
-0.65
POSITIVE LOGITS
observers
0.90
enough
0.90
considerations
0.87
examples
0.85
explanations
0.85
assumptions
0.84
feats
0.83
warnings
0.83
gestures
0.82
quantities
0.82
Activations Density 0.381%