INDEX
Explanations
issues of importance or concern
phrases expressing importance or significance
New Auto-Interp
Negative Logits
alone
-0.74
ellow
-0.70
aimon
-0.68
gee
-0.68
isoft
-0.64
hew
-0.64
fledged
-0.62
********
-0.62
ip
-0.61
pload
-0.61
POSITIVE LOGITS
hardest
1.54
most
1.50
longest
1.42
greatest
1.41
heaviest
1.40
happiest
1.35
fastest
1.31
quickest
1.31
strongest
1.31
widest
1.30
Activations Density 0.417%