INDEX
Explanations
mentions of aid and assistance
references to humanitarian aid
New Auto-Interp
Negative Logits
Bellev
-0.76
Beard
-0.70
é¾
-0.70
Ran
-0.68
Coat
-0.63
Grind
-0.63
Beau
-0.62
aber
-0.60
Mamm
-0.59
chrom
-0.57
POSITIVE LOGITS
aid
1.16
glers
0.88
Aid
0.86
maid
0.81
giving
0.80
aments
0.78
aids
0.78
lift
0.77
ilitation
0.76
uese
0.76
Activations Density 0.010%