INDEX
Explanations
words related to people in specific roles or positions
words related to students and participants in various contexts
New Auto-Interp
Negative Logits
cer
-0.71
Kuala
-0.68
rawdownloadcloneembedreportprint
-0.65
OOK
-0.64
Variety
-0.62
tein
-0.62
Thom
-0.61
pour
-0.57
south
-0.57
Kov
-0.57
POSITIVE LOGITS
selves
0.98
counterparts
0.94
folk
0.85
issance
0.85
mates
0.81
brethren
0.80
hip
0.79
ervative
0.78
holdings
0.69
iphate
0.68
Activations Density 0.151%