INDEX
Explanations
words related to development or evolution
instances of the word "dev" in various forms, particularly in relation to development or evolution
New Auto-Interp
Negative Logits
berman
-0.70
Twain
-0.70
terday
-0.69
xual
-0.68
Sapp
-0.66
Rouge
-0.66
Moonlight
-0.66
SHIP
-0.65
Tempest
-0.65
¥µ
-0.64
POSITIVE LOGITS
olved
1.46
irtual
1.43
olution
1.30
iated
1.30
olve
1.27
iates
1.24
iant
1.19
iate
1.19
olving
1.17
iating
1.16
Activations Density 0.026%