INDEX
Negative Logits
↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
0.46
↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
0.45
Alternatively
0.44
↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
0.42
↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
0.41
Alternatively
0.41
↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
0.41
↵↵↵↵↵↵↵↵↵↵↵↵↵↵
0.41
↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵↵
0.40
overdose
0.40
POSITIVE LOGITS
υπηρε
0.39
Cuisine
0.39
ἑ
0.39
List
0.38
hilfe
0.38
list
0.38
再來
0.38
SE
0.37
Sent
0.37
snd
0.37
Activations Density 0.004%