INDEX
Explanations
references to aspirations and goals
New Auto-Interp
Negative Logits
akis
-0.16
chron
-0.14
mushroom
-0.14
dikke
-0.14
wei
-0.14
Chron
-0.14
umb
-0.14
Franklin
-0.14
esa
-0.13
unca
-0.13
POSITIVE LOGITS
/go
0.19
istra
0.15
ÑĸнÑĮ
0.15
Crud
0.15
agna
0.14
èŃ
0.14
0.14
çĸ
0.14
goals
0.14
189
0.14
Activations Density 0.094%