INDEX
Explanations
adverbs or adjectives ending in 'ly'
adjectives and adverbs related to strength and support
New Auto-Interp
Negative Logits
çīĪ
-0.94
illions
-0.77
inea
-0.77
Duchess
-0.74
£ı
-0.72
士
-0.72
adelphia
-0.70
Deaths
-0.69
ा
-0.68
Millions
-0.67
POSITIVE LOGITS
(>
0.74
ambition
0.72
ambitions
0.71
gradient
0.69
defenses
0.69
ellow
0.68
directional
0.68
differentiation
0.68
tendency
0.66
characterization
0.65
Activations Density 0.297%