INDEX
Explanations
references to frustration or difficulty in achieving tasks
New Auto-Interp
Negative Logits
uld
-0.17
ekim
-0.17
ubo
-0.15
idor
-0.15
period
-0.14
peu
-0.14
ame
-0.14
ome
-0.14
éľĩ
-0.14
пеÑĢиод
-0.13
POSITIVE LOGITS
forever
0.49
ages
0.49
ages
0.39
Ages
0.38
FORE
0.37
Forever
0.37
Forever
0.34
AGES
0.33
fore
0.31
age
0.28
Activations Density 0.097%