INDEX
Explanations
references to cultural traditions and practices related to identity and heritage
New Auto-Interp
Negative Logits
Persona
-0.13
ÃĹ↵↵
-0.13
/GPL
-0.12
天åłĤ
-0.12
headline
-0.12
à¤ľà¤¨à¤¤
-0.12
γκα
-0.11
dirig
-0.11
Ấ
-0.11
.intellij
-0.11
POSITIVE LOGITS
passed
0.35
traditions
0.34
handed
0.31
preserved
0.30
Passed
0.29
tradition
0.28
passed
0.28
ä¼ł
0.28
practiced
0.27
centuries
0.26
Activations Density 0.212%