INDEX
Explanations
references to personal experiences and preferences
New Auto-Interp
Negative Logits
assumption
-0.52
dne
-0.50
assumption
-0.47
veden
-0.47
)](
-0.47
doty
-0.46
lesz
-0.46
prés
-0.45
DataAnnotations
-0.44
proven
-0.44
POSITIVE LOGITS
regularly
1.59
frequently
1.51
routinely
1.50
often
1.45
occasionally
1.44
always
1.36
rarely
1.32
sometimes
1.32
usually
1.28
frequently
1.24
Activations Density 0.556%