INDEX
Explanations
references to "Ta," indicating a focus on characters or entities with that name or prefix
New Auto-Interp
Negative Logits
</h1>
-0.73
Lowry
-0.73
Miller
-0.72
Klein
-0.67
Cline
-0.66
ции
-0.66
-0.65
thom
-0.65
Schlegel
-0.61
Lawson
-0.61
POSITIVE LOGITS
Ta
1.85
Ta
1.79
ta
1.63
ta
1.57
TA
1.52
TA
1.46
Taft
1.39
Tape
1.25
Tape
1.23
TAP
1.20
Activations Density 0.114%