INDEX
Explanations
words related to the action of rolling
New Auto-Interp
Negative Logits
ILE
-0.74
ior
-0.72
ãĥ¤
-0.69
acters
-0.67
orial
-0.66
IOR
-0.65
bour
-0.64
bub
-0.64
ãĤ»
-0.64
perse
-0.62
POSITIVE LOGITS
downhill
0.99
boulder
0.83
hills
0.81
dice
0.78
roll
0.75
icking
0.73
sheet
0.73
oats
0.73
sheets
0.73
rolling
0.68
Activations Density 0.009%