INDEX
Explanations
social media references and hashtags
New Auto-Interp
Head Attr Weights
0:0.04
1:0.03
2:0.04
3:0.11
4:0.07
5:0.05
6:0.04
7:0.05
8:0.06
9:0.05
10:0.08
11:0.33
Negative Logits
flag
-1.74
andise
-1.73
jam
-1.63
anthem
-1.60
flag
-1.57
76561
-1.54
Flag
-1.52
endars
-1.51
�
-1.50
AAF
-1.50
POSITIVE LOGITS
unemploy
1.75
confinement
1.68
inav
1.59
malnutrition
1.56
levels
1.51
Gord
1.50
eger
1.49
Processing
1.47
Classification
1.45
Gupta
1.44
Activations Density 0.038%