INDEX
Explanations
references to people or entities in brackets accompanied by some context
occurrences of brackets indicating quoted or referenced thoughts
New Auto-Interp
Negative Logits
ĪĴ
-0.81
ĻĤ
-0.80
ĸļ
-0.79
ãĤ¶
-0.73
ãĥł
-0.71
aru
-0.70
Ń·
-0.70
-+-+
-0.69
İĭ
-0.69
zh
-0.69
POSITIVE LOGITS
selves
0.76
},"
0.68
."[
0.62
Management
0.61
Mol
0.61
waivers
0.60
Manifest
0.60
,"
0.57
waiver
0.56
cape
0.55
Activations Density 0.077%