INDEX
Explanations
references to habits and routines
New Auto-Interp
Negative Logits
)(((
-0.78
SpringRunner
-0.77
imab
-0.64
реа
-0.62
WriteLiteral
-0.60
Gub
-0.60
candidatura
-0.60
ildi
-0.60
لينك
-0.59
שוליים
-0.59
POSITIVE LOGITS
habits
1.41
Habits
1.39
Habit
1.30
habit
1.27
Habit
1.23
habitudes
1.20
tradition
1.05
hábito
0.95
habit
0.95
Tradition
0.95
Activations Density 0.157%