INDEX
Explanations
references to the word "ant" or words that contain "ant" within larger words
occurrences of the word "ant" and its variants
New Auto-Interp
Negative Logits
UTION
-0.87
ESSION
-0.81
=-=-=-=-
-0.80
PASS
-0.78
Tycoon
-0.77
Done
-0.76
Gorge
-0.75
OURCE
-0.75
Divide
-0.75
UGC
-0.74
POSITIVE LOGITS
ant
0.88
iques
0.82
gard
0.81
elope
0.81
uve
0.80
oine
0.78
antit
0.78
ibl
0.77
icy
0.76
icycle
0.75
Activations Density 0.013%