INDEX
Explanations
phrases indicating composition or construction
phrases that describe composition or structure
New Auto-Interp
Negative Logits
ira
-0.74
AIDS
-0.74
uer
-0.72
lly
-0.72
aura
-0.70
mb
-0.70
Sport
-0.70
Answer
-0.70
hr
-0.68
abba
-0.67
POSITIVE LOGITS
several
0.99
disparate
0.98
varying
0.97
multiple
0.97
three
0.95
four
0.95
various
0.93
five
0.93
seven
0.90
two
0.90
Activations Density 0.108%