INDEX
Explanations
phrases that indicate a specific time period or duration
phrases that indicate a time frame or context
New Auto-Interp
Negative Logits
Genie
-0.66
mer
-0.66
fault
-0.64
bal
-0.63
di
-0.63
por
-0.62
devil
-0.61
FRE
-0.60
perpetually
-0.60
duct
-0.59
POSITIVE LOGITS
¥ŀ
0.98
avorite
0.96
ieth
0.95
ategory
0.94
ruciating
0.93
Within
0.91
uesday
0.89
htaking
0.88
Seconds
0.86
ĵĺ
0.85
Activations Density 0.008%