INDEX
Explanations
references to lunch and related terms
New Auto-Interp
Negative Logits
Nights
-0.21
evenings
-0.18
evening
-0.17
dawn
-0.17
nights
-0.17
breakfast
-0.16
.za
-0.16
night
-0.15
rena
-0.15
ensus
-0.15
POSITIVE LOGITS
time
0.33
room
0.24
ãĥ§
0.23
-time
0.22
times
0.19
box
0.19
-hour
0.18
ette
0.18
iez
0.18
ing
0.18
Activations Density 0.013%