INDEX
Explanations
references to the name "Gib" or related terms and variations
New Auto-Interp
Negative Logits
umer
-0.15
ingen
-0.15
sắc
-0.15
piler
-0.15
abl
-0.14
ying
-0.14
ynes
-0.14
ID
-0.14
ined
-0.14
rup
-0.14
POSITIVE LOGITS
bons
0.27
riel
0.20
ilter
0.17
ault
0.17
707
0.17
untu
0.17
bon
0.17
RARY
0.15
.override
0.15
iyel
0.15
Activations Density 0.012%