INDEX
Explanations
references to a specific brand or product
New Auto-Interp
Negative Logits
rolls
-0.15
Zaman
-0.15
cli
-0.15
ning
-0.15
gi
-0.14
oub
-0.14
o
-0.14
raith
-0.14
516
-0.14
out
-0.14
POSITIVE LOGITS
fo
0.32
Fo
0.28
Fo
0.28
fo
0.25
FO
0.20
resh
0.20
ibles
0.20
obar
0.19
isted
0.19
aming
0.17
Activations Density 0.015%