INDEX
Explanations
instances of the letter 'b'
New Auto-Interp
Negative Logits
isman
-0.18
اتÙĩ
-0.16
ully
-0.15
ONGL
-0.15
оÑĢÑĤÑĥ
-0.14
ers
-0.14
ores
-0.14
eds
-0.14
gnore
-0.14
io
-0.14
POSITIVE LOGITS
ingham
0.18
avin
0.17
feld
0.15
ecn
0.15
Gardner
0.14
λί
0.14
Hogan
0.14
tas
0.14
ttp
0.14
ÑĥÑĤ
0.13
Activations Density 0.035%