INDEX
Explanations
phrases or words related to showing preference or support for something
expressions of support or preference for something
New Auto-Interp
Negative Logits
colored
-1.31
--
-1.23
defense
-1.21
avored
-1.19
bors
-1.08
canceled
-1.07
traveled
-1.06
honor
-1.03
honoring
-1.02
labeled
-1.02
POSITIVE LOGITS
colour
2.21
recognise
2.20
realise
2.20
colour
2.15
colours
2.15
realised
2.13
organise
2.11
flavour
2.09
humour
2.08
practise
2.08
Activations Density 0.091%