INDEX
Explanations
references to professors
repeated mentions of the title "Professor" followed by names
New Auto-Interp
Negative Logits
destro
-0.88
queen
-0.84
ãĥ¼ãĥ³
-0.79
cruc
-0.69
fracture
-0.69
leash
-0.68
queens
-0.68
chorus
-0.67
takeoff
-0.66
burning
-0.65
POSITIVE LOGITS
essors
1.06
Laure
0.89
Puzz
0.89
ĨĴ
0.88
Emer
0.83
emer
0.82
Michel
0.79
Professor
0.79
umin
0.77
Wolfgang
0.77
Activations Density 0.021%