INDEX
Explanations
instances of the word "trace."
New Auto-Interp
Negative Logits
ſie
-0.71
geſ
-0.71
ſou
-0.71
]=>
-0.70
stället
-0.70
ſind
-0.68
ſein
-0.68
ſei
-0.65
beſte
-0.63
ſelves
-0.63
POSITIVE LOGITS
deliver
0.67
trace
0.66
traces
0.65
delivers
0.60
trace
0.59
Deliver
0.58
TRACE
0.57
delivering
0.56
delivered
0.54
useless
0.54
Activations Density 0.362%