INDEX
Explanations
the word "Rat" followed by a number
mentions of the word "Rat."
New Auto-Interp
Negative Logits
cu
-0.80
seaf
-0.79
CVE
-0.77
eclipse
-0.72
ssl
-0.66
ã
-0.65
faked
-0.63
fecture
-0.63
flaw
-0.63
autop
-0.62
POSITIVE LOGITS
Rat
4.11
Rat
2.60
Rats
2.13
rat
1.76
rat
1.74
Ratt
1.52
Rath
1.34
rats
1.28
rats
1.24
Cod
1.15
Activations Density 0.023%