INDEX
Explanations
instances of the word "in" followed by another word
instances of the phrase "sitting in."
New Auto-Interp
Negative Logits
ŀ
-0.68
endors
-0.64
aler
-0.64
elf
-0.62
linger
-0.59
NOW
-0.58
llor
-0.58
lihood
-0.58
killed
-0.54
purch
-0.54
POSITIVE LOGITS
ordinate
1.15
accordance
1.02
offensive
1.01
animate
0.99
situ
0.99
lieu
0.98
front
0.97
limbo
0.95
between
0.95
roads
0.94
Activations Density 0.225%