INDEX
Explanations
instances of past tense verbs ending in '-ed'
words related to complex or intricate concepts
New Auto-Interp
Negative Logits
imately
-0.85
theless
-0.83
ifully
-0.74
wise
-0.70
cipled
-0.68
channelAvailability
-0.66
fully
-0.65
ificantly
-0.65
trained
-0.63
silenced
-0.63
POSITIVE LOGITS
stros
0.93
unker
0.91
otypes
0.85
agonist
0.80
aution
0.80
rawler
0.79
izen
0.79
eatures
0.78
inter
0.78
ector
0.77
Activations Density 0.343%