INDEX
Explanations
mentions of particular roles that individuals embody or are associated with
references to role models
New Auto-Interp
Negative Logits
yg
-0.69
Gleaming
-0.67
Oo
-0.63
sight
-0.63
Trout
-0.63
eyeb
-0.63
REAM
-0.62
Bulg
-0.62
Brill
-0.61
Uri
-0.61
POSITIVE LOGITS
playing
1.39
reversal
1.03
played
0.97
model
0.94
models
0.90
plays
0.90
players
0.90
play
0.88
Playing
0.87
model
0.87
Activations Density 0.039%