INDEX
Explanations
general knowledge and informational
New Auto-Interp
Negative Logits
1
0.72
vaikut
0.63
privata
0.58
can
0.58
lige
0.58
maximale
0.57
direto
0.57
'
0.57
整个
0.57
Skye
0.56
POSITIVE LOGITS
General
1.04
umum
1.01
general
1.00
GENERAL
0.98
general
0.97
General
0.88
général
0.87
일반
0.87
eneral
0.86
जनरल
0.84
Activations Density 0.036%