INDEX
Explanations
references to young people and their experiences
New Auto-Interp
Negative Logits
orius
-0.89
âķIJ
-0.87
pmwiki
-0.84
ãĥ¢
-0.83
ãĥĥ
-0.83
ãĥ¯ãĥ³
-0.81
Cosponsors
-0.77
saf
-0.75
Accessory
-0.75
ãĥĥãĥĪ
-0.75
POSITIVE LOGITS
graduating
0.95
ages
0.88
aged
0.88
aspiring
0.88
graduates
0.86
enroll
0.85
immersed
0.85
aspir
0.82
enrolled
0.81
disillusion
0.80
Activations Density 0.097%