INDEX
Explanations
terms related to fraternity and sorority organizations
New Auto-Interp
Negative Logits
Decomp
-0.17
Professor
-0.14
chers
-0.14
ấu
-0.14
139
-0.13
Kit
-0.13
179
-0.13
burger
-0.13
resher
-0.13
ãĥĥ
-0.13
POSITIVE LOGITS
Rush
0.28
Ritual
0.27
Greek
0.27
rush
0.26
rush
0.25
ritual
0.25
chapter
0.24
rushes
0.24
Greeks
0.24
Greek
0.24
Activations Density 0.017%