INDEX
Explanations
phrases or words that involve emphasis or importance
the contraction "t" as in "can't" or "won't."
New Auto-Interp
Negative Logits
mascul
-0.62
domestically
-0.57
ADRA
-0.55
heels
-0.52
validated
-0.52
federation
-0.52
Reaper
-0.52
desk
-0.51
aside
-0.51
sparing
-0.51
POSITIVE LOGITS
t
3.69
ti
1.99
tan
1.95
tin
1.94
tif
1.88
tz
1.85
ta
1.82
ts
1.79
tal
1.79
tor
1.77
Activations Density 0.161%