INDEX
Explanations
the word "front" in the text
occurrences of the phrase "in front."
New Auto-Interp
Negative Logits
cci
-0.92
zie
-0.82
IRO
-0.75
Rating
-0.73
assets
-0.73
chini
-0.72
Flavoring
-0.71
vil
-0.70
cano
-0.70
ARP
-0.70
POSITIVE LOGITS
facing
0.74
ed
0.74
row
0.71
teeth
0.70
eering
0.67
runners
0.66
ocrin
0.66
office
0.66
sburg
0.65
yard
0.65
Activations Density 0.010%