INDEX
Explanations
descriptive phrases related to visual details and characteristics of people or events
New Auto-Interp
Negative Logits
ujednoznacz
-0.63
lenker
-0.61
Roskov
-0.59
SuccessListener
-0.58
المعيارى
-0.55
favoritas
-0.55
gradova
-0.54
exemplu
-0.53
démocratie
-0.53
apunov
-0.53
POSITIVE LOGITS
figure
1.13
figures
1.00
familiar
0.99
tall
0.94
unfamiliar
0.94
strange
0.88
figure
0.87
Figure
0.86
voice
0.84
familiar
0.84
Activations Density 0.271%