INDEX
Explanations
references to places or concepts related to bars in various contexts
New Auto-Interp
Negative Logits
y
-0.21
naire
-0.20
arily
-0.19
selling
-0.19
ese
-0.19
seller
-0.18
ester
-0.18
esse
-0.18
erson
-0.17
erver
-0.17
POSITIVE LOGITS
oque
0.27
itone
0.23
bers
0.23
becue
0.22
riers
0.20
celona
0.20
bell
0.18
ivec
0.18
BERS
0.18
codes
0.18
Activations Density 0.056%