INDEX
Explanations
adjectives or phrases related to a short duration
references to short durations or brief experiences
New Auto-Interp
Negative Logits
ILLE
-0.74
velt
-0.70
itational
-0.70
IRO
-0.68
ICAN
-0.64
KI
-0.64
JV
-0.64
Magikarp
-0.64
IAN
-0.63
Gener
-0.63
POSITIVE LOGITS
short
1.02
sighted
0.95
ening
0.92
ened
0.88
sword
0.84
comings
0.84
stature
0.82
lived
0.81
bread
0.81
leaf
0.80
Activations Density 0.018%