Performance of methods for small-sample inference in generalized linear mixed models for stepped-wedge designs with unequal cluster sizes