INDEX
Explanations
instances of the word "as" indicating comparisons or descriptions
New Auto-Interp
Negative Logits
aska
-0.64
ASD
-0.59
ese
-0.57
number
-0.57
lot
-0.57
psey
-0.56
english
-0.54
understanding
-0.54
gin
-0.54
idth
-0.53
POSITIVE LOGITS
icipated
0.72
natureconservancy
0.71
opped
0.71
mathemat
0.67
MIT
0.66
rowing
0.65
med
0.63
lighting
0.62
paio
0.61
idates
0.61
Activations Density 0.080%