INDEX
Explanations
the word "Roll" or variations of it
mentions of the word "Roll," indicating a focus on specific events or topics related to that term
New Auto-Interp
Negative Logits
ãĤ»
-0.92
ãĥģ
-0.87
ãĥŁ
-0.82
ãĥĺãĥ©
-0.77
ãĥ¤
-0.74
ILE
-0.72
occup
-0.69
avez
-0.69
é¾įåĸļ士
-0.69
BILITIES
-0.67
POSITIVE LOGITS
Roll
1.06
ers
0.92
er
0.91
Roll
0.90
enberg
0.87
breaker
0.80
ogie
0.79
ife
0.79
endale
0.78
erness
0.75
Activations Density 0.011%