INDEX
Explanations
questions related to personal preferences and choices
New Auto-Interp
Negative Logits
σουν
-0.56
σουμε
-0.51
initial
-0.47
susi
-0.46
appeared
-0.45
mişti
-0.44
appeared
-0.44
داشتند
-0.43
ন্
-0.43
increase
-0.42
POSITIVE LOGITS
daily
1.30
regularly
1.28
routinely
1.14
everyday
1.08
often
1.07
frequently
1.06
daily
1.05
annually
1.05
infrequently
1.04
yearly
1.03
Activations Density 0.480%