INDEX
Explanations
phrases related to technical instructions or descriptions
specific characters or symbols, particularly variations of the letter 'l'
New Auto-Interp
Negative Logits
Negro
-0.84
Reverend
-0.83
scandals
-0.78
imperialist
-0.77
Jama
-0.76
flourishing
-0.75
disgrace
-0.75
disgr
-0.75
outraged
-0.74
Jihad
-0.74
POSITIVE LOGITS
ï¸ı
1.42
ï¸
1.09
âĹ
1.05
§
1.00
â
0.97
£
0.97
âĸ
0.95
âĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢâĶĢ
0.95
Unique
0.94
initial
0.93
Activations Density 0.278%