INDEX
Explanations
phrases or numbers representing 'two-way' interactions or comparisons
phrases indicating a two-party system or duality in contexts
New Auto-Interp
Negative Logits
Ces
-0.67
Rodrigo
-0.64
imagination
-0.63
Gillespie
-0.62
commentary
-0.61
URRENT
-0.59
olics
-0.59
Gru
-0.59
Canaver
-0.59
Tuls
-0.59
POSITIVE LOGITS
thirds
1.86
dimensional
1.56
legged
1.52
hander
1.43
sided
1.41
dozen
1.38
way
1.36
tier
1.35
faced
1.34
bedroom
1.33
Activations Density 0.038%