INDEX
Explanations
references to authors, publications, and research studies
references to academic authors and their works
New Auto-Interp
Negative Logits
revenge
-0.94
Rebels
-0.85
franchise
-0.79
venge
-0.75
swing
-0.75
Salvation
-0.75
retaliate
-0.74
lockout
-0.73
Franchise
-0.71
grievance
-0.69
POSITIVE LOGITS
âĢIJ
1.04
et
1.01
resear
0.94
researchers
0.93
earcher
0.92
uscript
0.92
Study
0.90
studies
0.88
doctoral
0.88
PhD
0.86
Activations Density 0.373%