INDEX
Explanations
references to the concept of pride, particularly in the context of identity and community
references to the concept of pride, particularly in social and cultural contexts
New Auto-Interp
Negative Logits
chnology
-0.77
EVA
-0.69
女
-0.67
HELP
-0.67
ica
-0.61
fram
-0.61
iment
-0.61
ites
-0.61
apter
-0.60
ONES
-0.58
POSITIVE LOGITS
fully
1.11
ful
1.10
fulness
0.99
FUL
0.94
sac
0.90
hon
0.89
pride
0.84
auld
0.79
ously
0.79
Fest
0.77
Activations Density 0.029%