INDEX
Explanations
references to a specific video game or character called "Lem" or variations of it
references to a specific individual named Lem
New Auto-Interp
Negative Logits
Downloadha
-0.78
MODE
-0.72
gerald
-0.69
ãĤ¯
-0.69
ij士
-0.69
0000000
-0.68
ãĥĥãĥĪ
-0.68
ãĥ¼ãĥĨãĤ£
-0.67
aneers
-0.66
AQ
-0.66
POSITIVE LOGITS
borgh
1.04
oine
0.98
mons
0.96
ike
0.94
ongh
0.93
Lem
0.88
ond
0.88
osal
0.87
eny
0.86
onde
0.85
Activations Density 0.008%