|
Multi-view 3D People Reconstruction combining Parametric and Non-parametric models
Òscar Lorente,
Francesc Moreno-Noguer,
Enric Corona
dissertation
In the context of 3D human reconstruction, dissertation on the contribution of parametric models (SMPL) to the Implicit Differentiable Renderer (IDR), an architecture that implicitly represents the geometry as a zero level-set of a neural network, and uses differentiable rendering to train with weak 2D supervision.
|
|
Video Surveillance for Road Traffic Monitoring
Pol Albacar,
Òscar Lorente,
Eduard Mainou,
Ian Riera
arXiv, 2021
arXiv /
code
Solution to the third track of the AI-City Challenge, that aims to track vehicles across multiple cameras placed in multiple intersections spread out over a city. The methodology followed focuses first in solving multi-tracking in a single camera and then extending it to multiple cameras using siamese networks and metric learning.
|
|
3D Reconstruction of Urban Scenes
Josep Brugués,
Òscar Lorente,
Ian Riera,
Sergi García
2021
code /
slides
3D reconstruction of buildings from a set of images taken from different points of view (frontal images of the façades and aerial images). Rectify the perspective distortion from a single view, estimate essential and fundamental matrix, calibrate a camera with a planar pattern, estimate the depth of points in the scene given two images, generate new views of the scene, and compute a 3D model either from a set of calibrated or uncalibrated cameras (SfM).
|
|
Scene Understanding for Autonomous Driving
Òscar Lorente,
Ian Riera,
Aditya Rana
arXiv, 2021
arXiv /
code
Study of the behaviour of different configurations of RetinaNet, Faster R-CNN and Mask R-CNN (Detectron2) by a qualitative and quantitative evaluation on KITTI-MOTS, MOTSChallenge and out of context datasets.
|
|
Image Classification with Classic and Deep Learning Techniques
Òscar Lorente,
Ian Riera,
Aditya Rana
arXiv, 2021
arXiv /
code
Image classifier using both classic computer vision techniques (Bag of Visual Words classifier using SVM) and deep learning techniques (MLPs, InceptionV3 and our own CNN: TinyNet).
|
|
Museum Painting Retrieval
Òscar Lorente,
Ian Riera,
Shauryadeep Chaudhuri,
Oriol Catalan,
Víctor Casales
arXiv, 2021
arXiv /
code
Query by example CBIR system for finding paintings in a museum image collection using color, texture, text and feature descriptors in datasets with different perturbations in the images: noise, overlapping text boxes, color corruption and rotation.
|
|
Pedestrian Detection in 3D Point Clouds using Deep Neural Networks
Òscar Lorente,
Josep R. Casas,
Santiago Royo,
Ivan Caminal
arXiv, 2020
arXiv /
slides
PointNet++ based architecture to classify pedestrians in LIDAR point clouds using 3D clusters obtained by projecting 2D labels.
|
|
Image Restoration and Segmentation with Optimization Techniques
Òscar Lorente,
Aditya Rana,
Antoni Rodriguez
2020
code /
slides
Implement different optimization techniques to solve specific tasks: inpainting, Poisson editing, Chan-Vese segmentation and Markov Random Fields for image segmentation.
|
|