INDEX
Explanations
blades or blade-related words
references to blades, particularly in the context of tools or weapons
New Auto-Interp
Negative Logits
hett
-0.77
AMA
-0.73
LGBT
-0.72
sitcom
-0.71
Parks
-0.70
Christians
-0.70
Privacy
-0.68
LGBT
-0.67
sit
-0.67
aced
-0.66
POSITIVE LOGITS
blade
3.42
blades
3.24
blade
2.41
Blades
2.20
Blade
2.19
Blade
2.07
sword
1.54
rotor
1.51
lightsaber
1.51
knife
1.46
Activations Density 0.016%