INDEX
Explanations
the presence of the verb "is" indicating statements or definitions
New Auto-Interp
Negative Logits
371
-0.15
Pas
-0.14
Zimmerman
-0.14
sett
-0.14
settled
-0.14
illo
-0.14
Jo
-0.14
third
-0.14
Sol
-0.14
Weinstein
-0.14
POSITIVE LOGITS
ames
0.17
ifar
0.16
ersh
0.16
ormsg
0.16
/browse
0.15
acos
0.15
mates
0.15
mate
0.15
,...↵↵
0.14
abl
0.14
Activations Density 0.038%