INDEX
Explanations
round objects or concepts
occurrences of the word "round" and variations of it
New Auto-Interp
Negative Logits
destro
-0.73
Cald
-0.68
Guilty
-0.65
acca
-0.65
mathemat
-0.65
ionage
-0.65
earch
-0.63
wcs
-0.62
ettel
-0.61
ful
-0.61
POSITIVE LOGITS
table
1.21
abouts
1.20
about
1.17
trip
1.14
outs
0.98
ups
0.94
worms
0.93
robe
0.92
meal
0.87
rob
0.85
Activations Density 0.026%