Skip to content

Question for bird-and-animal training #68

@trustme4321

Description

@trustme4321

Hello, I'm interested in Cogview implemenation.
Actually I have problem in executing scripts/pretrain_single_node.sh
After 20,000 training iteration and executing ./scripts/text2image.sh, but there is no exact of shape for bird or animal in the result image. The text is 飞鹰(Flying eagle)
image

For training, I executed the shell script only changing the variable, NUM_GPUS_PER_WORKER from 8 to 1 since I have only one GPU instance.
I'm using Google Colab(A100 is supported).

  1. NUM_GPUS_PER_WORKER=1
  2. Image tokenizer number of tokens : 8192
  3. The Number of layers : 12
  4. The size of hidden layer : 1024
  5. The number of attention heads : 16

mp_rank_00_model_state.pt is the result file of training and the size of pt(pytorch) file is only 2.76Giga bytes.

Of course, if I use the pretrained model, cogview-base.tar, then the result is all right.
image

Please check my question and I hope you can give any advise or comment.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions