|
| 1 | +============================================================================================================== |
| 2 | +Layer (type:depth-idx) Output Shape Param # |
| 3 | +============================================================================================================== |
| 4 | +T5ForConditionalGeneration [2, 100, 512] -- |
| 5 | +├─T5Stack: 1-1 [2, 100, 512] 35,332,800 |
| 6 | +├─T5Stack: 1-2 -- (recursive) |
| 7 | +│ └─Embedding: 2-1 [2, 100, 512] 16,449,536 |
| 8 | +├─T5Stack: 1-3 -- (recursive) |
| 9 | +│ └─Dropout: 2-2 [2, 100, 512] -- |
| 10 | +│ └─ModuleList: 2-3 -- -- |
| 11 | +│ │ └─T5Block: 3-1 [2, 100, 512] 2,360,512 |
| 12 | +│ │ └─T5Block: 3-2 [2, 100, 512] 2,360,320 |
| 13 | +│ │ └─T5Block: 3-3 [2, 100, 512] 2,360,320 |
| 14 | +│ │ └─T5Block: 3-4 [2, 100, 512] 2,360,320 |
| 15 | +│ │ └─T5Block: 3-5 [2, 100, 512] 2,360,320 |
| 16 | +│ │ └─T5Block: 3-6 [2, 100, 512] 2,360,320 |
| 17 | +│ │ └─T5Block: 3-7 [2, 100, 512] 2,360,320 |
| 18 | +│ │ └─T5Block: 3-8 [2, 100, 512] 2,360,320 |
| 19 | +│ └─T5LayerNorm: 2-4 [2, 100, 512] 512 |
| 20 | +│ └─Dropout: 2-5 [2, 100, 512] -- |
| 21 | +├─T5Stack: 1-4 [2, 6, 100, 64] 16,449,536 |
| 22 | +│ └─Embedding: 2-6 [2, 100, 512] (recursive) |
| 23 | +│ └─Dropout: 2-7 [2, 100, 512] -- |
| 24 | +│ └─ModuleList: 2-8 -- -- |
| 25 | +│ │ └─T5Block: 3-9 [2, 100, 512] 3,147,456 |
| 26 | +│ │ └─T5Block: 3-10 [2, 100, 512] 3,147,264 |
| 27 | +│ │ └─T5Block: 3-11 [2, 100, 512] 3,147,264 |
| 28 | +│ │ └─T5Block: 3-12 [2, 100, 512] 3,147,264 |
| 29 | +│ │ └─T5Block: 3-13 [2, 100, 512] 3,147,264 |
| 30 | +│ │ └─T5Block: 3-14 [2, 100, 512] 3,147,264 |
| 31 | +│ │ └─T5Block: 3-15 [2, 100, 512] 3,147,264 |
| 32 | +│ │ └─T5Block: 3-16 [2, 100, 512] 3,147,264 |
| 33 | +│ └─T5LayerNorm: 2-9 [2, 100, 512] 512 |
| 34 | +│ └─Dropout: 2-10 [2, 100, 512] -- |
| 35 | +├─Linear: 1-5 [2, 100, 32128] 16,449,536 |
| 36 | +============================================================================================================== |
| 37 | +Total params: 128,743,488 |
| 38 | +Trainable params: 128,743,488 |
| 39 | +Non-trainable params: 0 |
| 40 | +Total mult-adds (M): 186.86 |
| 41 | +============================================================================================================== |
| 42 | +Input size (MB): 0.00 |
| 43 | +Forward/backward pass size (MB): 217.84 |
| 44 | +Params size (MB): 307.84 |
| 45 | +Estimated Total Size (MB): 525.69 |
| 46 | +============================================================================================================== |
0 commit comments