CosAE: Learnable Fourier Series for Image Restoration

(sifeiliu.net)

47 points | by E-Reverance 11 hours ago

6 comments

  • maxbond 7 hours ago
    I've been dabbling in using Fourier analysis in deep learning lately, and I'm surprised it that I haven't turned up very much research in this area (Fourier Neural Operators being what seems to be the biggest exception). Fourier analysis is such a ubiquitous tool, intuitively I'd think it would work great for deep learning. My suspicion has been that complex numbers are difficult to work with, and maybe I'm just bad at surfacing the relevant research, but I'd be interested to hear from those better informed. (My naive approach has been to simply concatenate the real and complex components together into an n+1 dimensional tensor, but surely there's a way that better respects the structure of complex numbers.)
    • Scene_Cast2 5 hours ago
      RoPE is somewhat related, I think, and it's pretty popular.

      There's also 2D rope for ViT, but I don't know how it works exactly.

  • gitroom 5 hours ago
    Been messing with this stuff too so I get the struggle. Cool results but man, waiting on code drops always drives me nuts.
  • sorenjan 11 hours ago
    These results look incredible, and with an inference time of only 36 ms for a 4X super resolution on a V100.
  • nullc 4 hours ago
    Might be useful to use gabor filters as the basis function, since just 2d cosine filters doesn't produce particularly sparse output for angled features. The additional overcompleteness would probably be helpful for the NN learning.
    • EMIRELADERO 4 hours ago
      A fun little bit of trivia: Mammalian brains implement Gabor filters in the primary visual cortex (V1), as the first step of the visual processing pipeline.
  • dingdingdang 10 hours ago
    No code has been released though?
    • sorenjan 10 hours ago
      That's addressed in the paper:

        Open access to data and code
        Question: Does the paper provide open access to the data and code, with sufficient instruc-
        tions to faithfully reproduce the main experimental results, as described in supplemental
        material?
        Answer: [No]
        Justification: Although we have answered “No” for now, we intend to release the code and
        models to enable the reproducibility of our main experimental results, pending approval
        from the legal department. This temporary status reflects our commitment to open access
        once all necessary permissions are secured.
      • GaggiX 10 hours ago
        The paper was released a few months ago for context.
  • doctorpangloss 6 hours ago
    Autoencoders are catching up. Next: luminosity separated from color and UCS.