INDEX
Explanations
references to transportation or vehicles, particularly streetcars and trolleys
trolley, car, or dolly
New Auto-Interp
Negative Logits
is
-0.39
!
-0.39
Fink
-0.37
*
-0.37
<blockquote>
-0.35
...
-0.35
-0.35
in
-0.35
</blockquote>
-0.34
-
-0.34
POSITIVE LOGITS
trolley
2.20
Trolley
2.11
rolley
1.55
trol
1.43
müſſen
1.05
ロウィン
0.98
laſſen
0.93
ainfi
0.91
<unused23>
0.89
majánló
0.89
Activations Density 0.003%