INDEX
Explanations
words related to government assistance programs
references to food stamps and financial support programs
New Auto-Interp
Negative Logits
Stras
-0.77
Ae
-0.74
Beau
-0.73
SOURCE
-0.73
semble
-0.72
Argon
-0.69
IENT
-0.65
Jenner
-0.64
Bucc
-0.64
RIS
-0.63
POSITIVE LOGITS
stamps
1.17
stamp
0.86
pak
0.76
uity
0.75
icket
0.72
stripe
0.71
cards
0.71
eding
0.70
manship
0.70
ishing
0.69
Activations Density 0.011%