INDEX
Explanations
mentions of bicycles
references to bicycles
New Auto-Interp
Negative Logits
arial
-1.03
arios
-0.94
nyder
-0.82
ips
-0.78
ests
-0.77
orial
-0.77
oys
-0.76
essee
-0.76
itia
-0.76
esting
-0.76
POSITIVE LOGITS
bicycle
1.00
puter
0.90
Bicycle
0.84
erg
0.83
©¶æ¥µ
0.79
bicycles
0.77
bicy
0.76
bicycl
0.74
©¶æ
0.74
Friendly
0.73
Activations Density 0.004%