INDEX
Explanations
objects being described in detail
phrases that describe items accompanied by the word "with."
New Auto-Interp
Negative Logits
thren
-0.74
ights
-0.70
hack
-0.70
borgh
-0.67
ati
-0.66
Asia
-0.63
cemic
-0.63
Leaks
-0.63
phrine
-0.63
iance
-0.63
POSITIVE LOGITS
stood
1.53
regard
1.38
regards
1.31
drawn
1.17
respect
1.14
standing
1.06
impunity
1.05
draw
1.05
holding
0.87
whom
0.78
Activations Density 0.174%