INDEX
Explanations
adjectives indicating either physical characteristics or emotional states
New Auto-Interp
Negative Logits
succession
-0.71
thrott
-0.68
inav
-0.68
asser
-0.66
shuttle
-0.64
iking
-0.64
house
-0.63
pigeon
-0.62
estranged
-0.61
ãĥ¼ãĥĨãĤ£
-0.59
POSITIVE LOGITS
Anyway
1.41
ï¸ı
1.20
Anyway
1.19
;;
1.14
endif
1.09
Congratulations
0.94
=-=-=-=-=-=-=-=-
0.93
Alright
0.93
=================================================================
0.93
;)
0.92
Activations Density 0.134%