INDEX
Explanations
mentions of specific policy schemes or programs
mentions of various schemes or plans
New Auto-Interp
Negative Logits
Flo
-0.68
ãģĹ
-0.64
inas
-0.64
azines
-0.63
Nob
-0.62
Brah
-0.62
Qiao
-0.59
Flo
-0.58
Clover
-0.57
lean
-0.56
POSITIVE LOGITS
schemes
1.09
scheme
1.03
etary
0.90
devised
0.86
eers
0.83
ĸļ
0.80
eering
0.77
udeb
0.76
ulence
0.76
Scheme
0.71
Activations Density 0.015%