INDEX
Explanations
phrases indicating regularity or frequency, particularly involving the word "every" followed by a number
New Auto-Interp
Negative Logits
phabet
-0.79
çīĪ
-0.68
Viz
-0.68
anonymity
-0.67
dict
-0.66
ez
-0.64
Sakuya
-0.64
devils
-0.62
labels
-0.61
ãĥ¯
-0.59
POSITIVE LOGITS
conceivable
0.93
THING
0.92
single
0.85
inch
0.82
imaginable
0.81
where
0.75
hour
0.75
ursday
0.75
month
0.74
single
0.74
Activations Density 0.019%