INDEX
Negative Logits
celebr
-0.62
proud
-0.62
Khe
-0.59
analges
-0.58
inclusion
-0.58
migration
-0.58
contag
-0.58
invisible
-0.57
contagious
-0.57
belonging
-0.57
POSITIVE LOGITS
Answer
1.38
Well
1.08
³³³³
0.99
Absolutely
0.97
Yes
0.95
³³³³³³³³³³³³³³³³
0.91
Honestly
0.91
Correct
0.88
³³³³³³³³
0.88
Probably
0.87
Activations Density 0.136%