INDEX
Explanations
topics related to government departments and public health issues
New Auto-Interp
Head Attr Weights
0:0.03
1:0.01
2:0.09
3:0.05
4:0.07
5:0.03
6:0.12
7:0.32
8:0.05
9:0.03
10:0.08
11:0.06
Negative Logits
proof
-1.61
Niet
-1.57
crowds
-1.52
virtues
-1.51
compliments
-1.50
submission
-1.50
ingen
-1.48
listeners
-1.46
giveaway
-1.45
Redditor
-1.45
POSITIVE LOGITS
Agriculture
1.71
arent
1.66
ederal
1.65
arte
1.63
ibur
1.55
Sciences
1.52
Esp
1.45
�醒
1.45
Aerospace
1.45
roleum
1.45
Activations Density 0.001%