INDEX
Explanations
references to a successful history or reputation
New Auto-Interp
Negative Logits
ansa
-0.17
ahu
-0.16
tracks
-0.15
oppins
-0.15
iske
-0.15
tracks
-0.14
trackers
-0.14
нак
-0.14
ounter
-0.14
dy
-0.14
POSITIVE LOGITS
record
0.36
-record
0.28
record
0.27
Record
0.27
Record
0.22
RECORD
0.22
_record
0.21
records
0.21
.record
0.20
record
0.19
Activations Density 0.003%