INDEX
Explanations
phrases related to legal agreements and commitments
New Auto-Interp
Negative Logits
spotting
-0.76
Genie
-0.70
brainstorm
-0.67
Revival
-0.66
Chimera
-0.65
COMPLE
-0.65
ovember
-0.64
ilet
-0.64
Makoto
-0.64
history
-0.64
POSITIVE LOGITS
belonged
1.00
belongs
0.95
belong
0.93
accustomed
0.90
allegiance
0.90
subscrib
0.88
subscribe
0.88
entitle
0.87
subscribed
0.85
attached
0.85
Activations Density 0.073%