INDEX
Explanations
references to official organizations and their activities
New Auto-Interp
Negative Logits
bicchiere
-0.67
frattempo
-0.63
cervello
-0.63
paesaggio
-0.59
benessere
-0.58
quartiere
-0.57
tramonto
-0.56
linguaggio
-0.56
discorso
-0.56
ritratto
-0.54
POSITIVE LOGITS
Caro
0.68
Caro
0.68
Mero
0.66
Milo
0.65
Duro
0.62
Vero
0.61
Waldo
0.60
Maro
0.60
Viro
0.60
Leto
0.59
Activations Density 1.301%