INDEX
Explanations
words related to enhancement or improvement
New Auto-Interp
Negative Logits
Leilan
-0.74
HIP
-0.72
Bullets
-0.70
Fired
-0.68
externalToEVAOnly
-0.68
Funk
-0.67
Rampage
-0.67
Nun
-0.66
LOAD
-0.65
Shattered
-0.65
POSITIVE LOGITS
anced
1.71
ancing
1.44
ance
1.17
ancer
1.11
ancers
1.10
ances
1.01
ove
0.98
obbies
0.95
orse
0.95
urst
0.94
Activations Density 0.002%