INDEX
Explanations
mentions of saving or saving lives
terms related to saving lives or preventing loss
New Auto-Interp
Negative Logits
interstitial
-0.66
VP
-0.63
FG
-0.63
hole
-0.62
Intern
-0.61
yond
-0.61
Remastered
-0.61
hern
-0.60
Queue
-0.60
KER
-0.60
POSITIVE LOGITS
Save
0.92
Save
0.77
Lives
0.76
Saving
0.75
Sanctuary
0.74
souls
0.74
idences
0.73
saving
0.73
lives
0.73
luc
0.72
Activations Density 0.031%