INDEX
Explanations
statements related to legal or medical cases
New Auto-Interp
Negative Logits
Mas
-0.54
Ind
-0.52
ind
-0.51
мо
-0.50
Dou
-0.49
чел
-0.47
зия
-0.47
[\
-0.45
Ind
-0.45
af
-0.45
POSITIVE LOGITS
tvguidetime
1.06
myſelf
0.95
Мексичка
0.94
Efq
0.93
theſe
0.92
raiſ
0.92
Jefus
0.92
Theſe
0.89
ſeveral
0.87
ResumeLayout
0.87
Activations Density 0.178%