INDEX
Explanations
positive affirmations or fortunate events
expressions of relief or fortunate circumstances
New Auto-Interp
Negative Logits
chairs
-0.70
design
-0.69
kindred
-0.68
appro
-0.68
newsletters
-0.67
vice
-0.65
abo
-0.64
laundry
-0.63
dq
-0.63
raised
-0.62
POSITIVE LOGITS
fortunately
0.89
Fortunately
0.84
Thankfully
0.83
thankfully
0.77
luckily
0.76
Luckily
0.76
ESA
0.75
wcs
0.74
Thankfully
0.73
mosqu
0.72
Activations Density 0.007%