INDEX
Explanations
references to sacrifice and the impacts of those sacrifices on various aspects of life
New Auto-Interp
Head Attr Weights
0:0.02
1:0.02
2:0.05
3:0.04
4:0.11
5:0.03
6:0.05
7:0.36
8:0.05
9:0.03
10:0.11
11:0.07
Negative Logits
iott
-1.76
Cola
-1.75
INO
-1.67
jong
-1.60
CENT
-1.60
INFO
-1.60
videos
-1.55
xxx
-1.50
akespeare
-1.50
Citation
-1.47
POSITIVE LOGITS
sacrifice
1.98
sacrificed
1.80
Sacrifice
1.80
sacrifices
1.66
virginity
1.65
sacrificing
1.65
subsistence
1.59
scraps
1.57
sovere
1.57
forfeit
1.52
Activations Density 0.008%