INDEX
Explanations
instances of the preposition "in" followed by other words
the presence of the phrase "in" followed by contextually relevant information
New Auto-Interp
Negative Logits
bos
-0.74
irl
-0.71
LOG
-0.70
BIL
-0.70
hester
-0.69
Attempts
-0.68
hov
-0.68
ENCE
-0.68
alion
-0.66
gans
-0.66
POSITIVE LOGITS
clusions
1.19
animate
1.09
versions
1.04
lieu
0.99
relation
0.96
between
0.95
organic
0.94
appropriate
0.94
effic
0.93
totality
0.91
Activations Density 0.351%