INDEX
Explanations
words containing the specific pattern "oured" such as "coloured" and "flavoured"
words related to consumption or preferences
New Auto-Interp
Negative Logits
Title
-0.63
lin
-0.61
theater
-0.60
Sheldon
-0.59
labor
-0.58
charism
-0.58
credential
-0.57
Texas
-0.57
Becker
-0.56
behavioral
-0.56
POSITIVE LOGITS
oured
4.39
ouring
3.17
ours
2.86
our
2.32
OUR
1.82
ored
1.76
ORED
1.63
oring
1.34
iour
1.17
orer
1.14
Activations Density 0.008%