13 lines
606 B
Markdown
13 lines
606 B
Markdown
![[Pasted image 20240130103300.png]]
|
|
|
|
|
|
> btw, i suspect that the coordconv at the first layer might not be necessary (coord0)
|
|
> at the first layer the CNN just does low level feature dedection (detecting edges and so on)
|
|
> the coordconv are only helpful for higher level analysis deeper in the model
|
|
> at least that's my theory
|
|
> also, do i see correctly that after the first layer, there's 1 maxpool that makes it 4x smaller? from 2048x1024 to 512x256?
|
|
> or does the first layer use stride 2, and the maxpool only makes it 2x smaller?
|
|
> overall the architecture looks good :+1:
|
|
> - Comments [[Floris]]
|
|
|