INDEX
Explanations
phrases related to value judgments or assessments
conjunctions, particularly the word "and" in various contexts
New Auto-Interp
Negative Logits
hoe
-0.73
ocular
-0.67
ursday
-0.66
emp
-0.65
igraph
-0.61
Wheat
-0.61
orial
-0.61
Lank
-0.60
oid
-0.60
Morning
-0.59
POSITIVE LOGITS
THEN
0.85
expects
0.84
hence
0.82
consequently
0.81
deserve
0.80
therefore
0.78
hopefully
0.77
furthermore
0.76
then
0.76
thus
0.76
Activations Density 0.150%