INDEX
Explanations
references to the "Star Wars" franchise
references to the "Star Wars" franchise
New Auto-Interp
Negative Logits
pd
-0.71
pine
-0.69
mt
-0.67
certify
-0.67
commit
-0.64
eger
-0.63
fingerprint
-0.62
certified
-0.61
handle
-0.61
body
-0.61
POSITIVE LOGITS
Wars
4.13
Wars
2.59
wars
2.21
War
1.63
War
1.53
Warfare
1.52
WAR
1.37
Sith
1.32
Trek
1.25
Rebellion
1.25
Activations Density 0.018%