INDEX
Explanations
phrases related to safety and health concerns
New Auto-Interp
Negative Logits
↵
-0.50
ar
-0.48
<eos>
-0.47
Alex
-0.45
vi
-0.45
ček
-0.43
rs
-0.43
si
-0.43
correspon
-0.43
vo
-0.42
POSITIVE LOGITS
basicConfig
0.98
itſelf
0.94
Normdatei
0.91
UnsafeEnabled
0.91
Мексичка
0.86
GraphicsUnit
0.86
клопе
0.84
مشين
0.83
expandindo
0.83
FetchType
0.82
Activations Density 0.500%