INDEX
Explanations
verbs or phrases indicating preference or support
occurrences of the word "favored" and its variants
New Auto-Interp
Negative Logits
apologise
-0.79
Rehab
-0.78
realise
-0.73
ember
-0.69
Act
-0.69
ecycle
-0.68
netflix
-0.67
analyse
-0.66
iry
-0.64
teness
-0.64
POSITIVE LOGITS
favored
3.85
favoured
3.06
favors
2.16
favoring
2.02
favor
1.89
preferred
1.81
avored
1.75
favorites
1.55
favorable
1.48
prized
1.42
Activations Density 0.022%