INDEX
Explanations
even attempting, asking, discussing
New Auto-Interp
Negative Logits
дії
0.43
suddenly
0.43
постоянно
0.39
awsze
0.38
hton
0.38
continuamente
0.38
എല്ല
0.38
ien
0.37
athor
0.37
sudden
0.37
POSITIVE LOGITS
siquiera
0.84
even
0.79
даже
0.71
навіть
0.66
mention
0.66
nawet
0.64
mere
0.63
even
0.63
bahkan
0.63
EVEN
0.62
Activations Density 0.032%