INDEX
Explanations
mentions of the specific word "rows."
mentions of 'rows' or related contexts
New Auto-Interp
Negative Logits
PLAN
-0.69
circumstance
-0.68
natureconservancy
-0.64
incentive
-0.64
Scandinavian
-0.63
Kenyan
-0.61
piece
-0.60
retake
-0.60
disparate
-0.60
miracle
-0.59
POSITIVE LOGITS
rows
1.29
restling
1.04
icz
0.97
olver
0.93
ski
0.91
hip
0.90
lings
0.90
ight
0.88
rowing
0.88
abies
0.87
Activations Density 0.004%