INDEX
Explanations
any questions or statements
New Auto-Interp
Negative Logits
весьма
0.50
nicht
0.47
ไม่ใช่
0.47
cannot
0.45
excelente
0.45
cannot
0.44
zeer
0.43
មិន
0.43
vrlo
0.43
rất
0.43
POSITIVE LOGITS
আদৌ
0.97
ever
0.92
acaso
0.89
siquiera
0.80
überhaupt
0.73
EVER
0.73
any
0.71
jakieś
0.68
…?
0.68
anything
0.68
Activations Density 0.074%