INDEX
Explanations
disruptions to established routines and patterns in daily life
New Auto-Interp
Negative Logits
deaux
-0.18
á»Ń
-0.15
usi
-0.14
grade
-0.14
æĻ´
-0.14
ilon
-0.14
ươi
-0.14
ëĿ½
-0.14
usz
-0.13
enek
-0.13
POSITIVE LOGITS
pattern
0.50
patterns
0.48
Pattern
0.46
Patterns
0.45
Pattern
0.45
pattern
0.44
-pattern
0.41
patterns
0.41
Patterns
0.39
_pattern
0.37
Activations Density 0.020%