INDEX
Explanations
instances of the word 'front'
occurrences of the word "front."
New Auto-Interp
Negative Logits
IRO
-0.81
cci
-0.81
dayName
-0.77
cles
-0.77
fortune
-0.76
yssey
-0.75
ascript
-0.74
======
-0.72
cham
-0.71
ongo
-0.71
POSITIVE LOGITS
lawn
1.03
porch
1.01
runners
0.97
iers
0.90
mast
0.82
most
0.82
row
0.81
door
0.80
yard
0.77
side
0.75
Activations Density 0.018%