INDEX
Explanations
sentences related to information about various topics, such as surveillance, medical practices, politics, technology, and social issues
references to specific legislative or policy issues
New Auto-Interp
Head Attr Weights
0:0.12
1:0.01
2:0.06
3:0.10
4:0.03
5:0.13
6:0.04
7:0.07
8:0.14
9:0.04
10:0.14
11:0.07
Negative Logits
comprises
-0.85
Coconut
-0.83
Bras
-0.79
devote
-0.77
ITAL
-0.77
onwards
-0.76
Mehran
-0.76
Grac
-0.75
Esc
-0.75
�
-0.74
POSITIVE LOGITS
igel
1.01
vil
0.99
areth
0.94
asel
0.89
predec
0.89
uli
0.88
dyn
0.88
gam
0.88
imeo
0.87
ramid
0.87
Activations Density 0.199%