INDEX
Explanations
references to specific geographical locations or economic figures
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.12
3:0.09
4:0.38
5:0.02
6:0.07
7:0.06
8:0.05
9:0.03
10:0.04
11:0.05
Negative Logits
sided
-1.70
ベ
-1.63
iability
-1.61
utations
-1.54
errors
-1.53
advant
-1.51
�
-1.51
effic
-1.48
respons
-1.48
excuses
-1.45
POSITIVE LOGITS
Others
1.65
Olymp
1.61
Kardash
1.60
Doctors
1.59
others
1.56
ateurs
1.54
Healer
1.51
busters
1.49
Compan
1.49
Huck
1.48
Activations Density 0.013%