INDEX
Explanations
instances of the verb "is" and its variations across sentences
New Auto-Interp
Negative Logits
hl
-0.17
511
-0.14
dubious
-0.14
иÑģÑĤ
-0.14
arp
-0.14
-in
-0.14
-0.14
ίÏĥ
-0.13
iske
-0.13
rops
-0.13
POSITIVE LOGITS
possible
0.28
possible
0.27
raining
0.24
Possible
0.23
Possible
0.23
posible
0.22
incumbent
0.21
hoped
0.20
impossible
0.20
unclear
0.20
Activations Density 0.302%