INDEX
Explanations
phrases related to self-improvement and personal growth
New Auto-Interp
Negative Logits
urus
-0.50
נוּ
-0.47
Bland
-0.46
tgärder
-0.45
kam
-0.45
**/
-0.45
glGen
-0.45
lemb
-0.42
demik
-0.42
ناد
-0.41
POSITIVE LOGITS
learning
0.81
Valuable
0.77
learnings
0.76
valuable
0.73
learning
0.73
valuable
0.72
learn
0.70
learns
0.70
Learning
0.70
Learning
0.70
Activations Density 0.385%