INDEX
Explanations
references to the concept of time and its implications for change or action
New Auto-Interp
Negative Logits
enis
-0.15
avou
-0.14
Ñģло
-0.14
Labels
-0.13
.bio
-0.13
Gee
-0.13
Çİ
-0.13
mur
-0.13
Marble
-0.13
asia
-0.13
POSITIVE LOGITS
times
0.15
-times
0.14
ulo
0.14
Ðļоли
0.14
itol
0.14
ogi
0.14
romium
0.14
šel
0.14
spent
0.13
ÑģоÑĩ
0.13
Activations Density 0.056%