INDEX
Explanations
phrases with "as" followed by a noun or pronoun
phrases that establish roles or functions of entities
New Auto-Interp
Negative Logits
ubb
-0.72
Rub
-0.67
raq
-0.63
ros
-0.60
"}],"
-0.60
ople
-0.60
zza
-0.59
aceae
-0.59
COUR
-0.57
jee
-0.57
POSITIVE LOGITS
opposed
1.11
well
1.01
pires
0.93
pired
0.92
part
0.90
follows
0.89
evidenced
0.86
piring
0.84
part
0.84
well
0.83
Activations Density 0.138%