INDEX
Explanations
references to family-run businesses and dog breeds
New Auto-Interp
Negative Logits
yp
-0.16
reesome
-0.15
ÙijÙİ
-0.14
çĺ
-0.14
nghiá»ĩp
-0.14
kowski
-0.14
yny
-0.14
.respond
-0.13
ãģ£ãģ
-0.13
sooner
-0.13
POSITIVE LOGITS
now
0.24
_now
0.21
now
0.21
still
0.19
ahora
0.19
today
0.19
still
0.18
STILL
0.18
today
0.17
Now
0.17
Activations Density 0.180%