INDEX
Explanations
young people
references to young people and their experiences or challenges
New Auto-Interp
Negative Logits
ãĥĥ
-0.95
âķIJ
-0.88
ãĥ¯ãĥ³
-0.87
ãĥ¢
-0.87
ãĥĥãĥĪ
-0.86
ãĥ¬
-0.83
ãĥ¼ãĥĨ
-0.79
pmwiki
-0.78
orius
-0.77
ãĤ¨ãĥ«
-0.75
POSITIVE LOGITS
todd
0.78
graduates
0.77
inexper
0.74
emanc
0.70
disillusion
0.69
udi
0.68
aspir
0.66
aspiring
0.64
immersed
0.62
graduating
0.60
Activations Density 0.140%