Bio

I am an AI Engineer and Researcher experienced in developing AI solutions for challenging problems. Previously, I worked at Hanwha Vision America to develop next-generation solutions for real-time image segmentation on edge devices. I have received my Ph.D. in Computer Vision and Deep Learning from The University of Texas at Arlington (UTA). During my Ph.D., I worked as a graduate researcher at the Robotic Vision Lab and the LEARN Lab at UTA where I developed novel deep learning solutions for 3D reconstruction and generation. Prior to that I have worked as a senior software engineer at TwinBit.
My research focuses on Deep learning with its applications in Artificial Intelligence, Robotics and Computer Vision. Take a look at my resume or my Google Scholar profile.

News

November, 2023

One Paper accepted at the Journal of Visual Communication and Image Representation

My paper titled "IPVNet: Learning Implicit Point-Voxel Features for Open-Surface 3D Reconstruction" was accepted at the Journal of Visual Communication and Image Representation.
October, 2023

One Paper accepted at ICCV

My paper titled "LIST: Learning Implicitly from Spatial Transformers for Single-View 3D Reconstruction" was accepted at the IEEE/CVF International Conference on Computer Vision (ICCV).
August, 2023

Started new job

I have started working as a AI research intern at Hanwha Vision America. Here, I will be developing novel solutions for real-time detection, segmentation on edge devices.
August, 2023

Successfully defended dissertation

I have successfully defended my dissertation defense and completed the final requirement of the Ph.D process. My dissertation titled Generative and Implicit Methods for 3D Point Cloud Processing can be accessed from here .
October, 2022

One Paper accepted at IEEE ISMAR 2022

My paper titled "Automated Reconstruction of 3D Open Surfaces from Sparse Point Clouds" was accepted at the PIES-MR workshop of IEEE ISMAR 2022.
October, 2020

One Paper accepted at 3DV 2020

My paper titled "A Progressive Conditional Generative Adversarial Network for Generating Dense and Colored 3D Point Clouds" was accepted at 3DV 2020.

Experience

Aug. 2023 - Nov. 2023

AI Research Intern

Hanwha Vision America

Santa Clara, CA
Aug. 2016 - Aug. 2023

Graduate Researcher

University of Texas at Arlington

Arlington, TX
Jan. 2015 - Aug. 2016

Software Engineer

TwinBit

Dhaka, Bangladesh

Education

Ph.D.Aug. 2016 - Aug. 2023

Ph.D. in Computer Science

University of Texas at Arlington
B.Sc.2008 - 2013

Bachelor of Science in Computer Science and Engineering

Shahjala University of Science and Technology

Publications

Sort by year

IPVNet: Learning Implicit Point-Voxel Features for Open-Surface 3D Reconstruction

Mohammad Samiul Arshad, William J. Beksi

Conference Papers The Journal of Visual Communication and Image Representation, 2023.

Abstract

Reconstruction of 3D open surfaces (e.g., non-watertight meshes) is an underexplored area of computer vision. Recent learning-based implicit techniques have removed previous barriers by enabling reconstruction in arbitrary resolutions. Yet, such approaches often rely on distinguishing between the inside and outside of a surface in order to extract a zero level set when reconstructing the target. In the case of open surfaces, this distinction often leads to artifacts such as the artificial closing of surface gaps. However, real-world data may contain intricate details defined by salient surface gaps. Implicit functions that regress an unsigned distance field have shown promise in reconstructing such open surfaces. Nonetheless, current unsigned implicit methods rely on a discretized representation of the raw data. This not only bounds the learning process to the representation’s resolution, but it also introduces outliers in the reconstruction. To enable accurate reconstruction of open surfaces without introducing outliers, we propose a learning-based implicit point-voxel model (IPVNet). IPVNet predicts the unsigned distance between a surface and a query point in 3D space by leveraging both raw point cloud data and its discretized voxel counterpart. Experiments on synthetic and real-world public datasets demonstrates that IPVNet outperforms the state of the art while producing far fewer outliers in the resulting reconstruction.

LIST: Learning Implicitly from Spatial Transformers for Single-View 3D Reconstruction

Mohammad Samiul Arshad, William J. Beksi

Conference Papers The IEEE/CVF International Conference on Computer Vision (ICCV), 2023

Abstract

Accurate reconstruction of both the geometric and topological details of a 3D object from a single 2D image embodies a fundamental challenge in computer vision. Existing explicit/implicit solutions to this problem struggle to recover self-occluded geometry and/or faithfully reconstruct topological shape structures. To resolve this dilemma, we introduce LIST, a novel neural architecture that leverages local and global image features to accurately reconstruct the geometric and topological structure of a 3D object from a single image. We utilize global 2D features to predict a coarse shape of the target object and then use it as a base for higher-resolution reconstruction. By leveraging both local 2D features from the image and 3D features from the coarse prediction, we can predict the signed distance between an arbitrary point and the target surface via an implicit predictor with great accuracy. Furthermore, our model does not require camera estimation or pixel alignment. It provides an uninfluenced reconstruction from the input-view direction. Through qualitative and quantitative analysis, we show the superiority of our model in reconstructing 3D objects from both synthetic and real-world images against the state of the art.

Automated Reconstruction of 3D Open Surfaces from Sparse Point Clouds

Mohammad Samiul Arshad, William J. Beksi

Conference Papers The IEEE International Symposium on Mixed and Augmented Reality (ISMAR) Workshops, 2022.

Abstract

Real-world 3D data may contain intricate details defined by salient surface gaps. Automated reconstruction of these open surfaces (e.g., non-watertight meshes) is a challenging problem for environment systhesis in mixed reality applications. Current learning-based implicit techniques can achieve high fidelity on closed-surface reconstruction. However, their dependence on the distinction between the inside and outside of a surface makes them incapable of reconstructing open surfaces. Recently, a new class of implicit functions have shown promise in reconstructing open surfaces by regressing an unsigned distance field. Yet, these methods rely on a discretized representation of the raw data, which loses important surface details and can lead to outliers in the reconstruction. We propose IPVNet, a learning-based implicit model that predicts the unsigned distance between a surface and a query point in 3D space by leveraging both raw point cloud data and its discretized voxel counterpart. Experiments on synthetic and real-world public datasets demonstrates that IPVNet outperforms the state of the art while producing far fewer outliers in the reconstruction.

A Progressive Conditional Generative Adversarial Network for Generating Dense and Colored 3D Point Clouds

Mohammad Samiul Arshad, William J. Beksi

Conference Papers The 8th International Conference on 3D Vision (3DV), November 2020.

Abstract

In this paper, we introduce a novel conditional generative adversarial network that creates dense 3D point clouds, with color, for assorted classes of objects in an unsupervised manner. To overcome the difficulty of capturing intricate details at high resolutions, we propose a point transformer that progressively grows the network through the use of graph convolutions. The network is composed of a leaf output layer and an initial set of branches. Every training iteration evolves a point vector into a point cloud of increasing resolution. After a fixed number of iterations, the number of branches is increased by replicating the last branch. Experimental results show that our network is capable of learning and mimicking a 3D data distribution, and produces colored point clouds with fine details at multiple resolutions.

An Automated Bengali Sign Language Recognition System Based on Fingertip Finder Algorithm

Angur M. Jarman, Mohammad Samiul Arshad, Nashid Alam, Mohammed J. Islam

Journal PapersInternational Journal of Electronics & Informatics (IJEI), Volume: 4, Issue: 1, July 2015

Abstract

This paper presents a new algorithm to identify Bengali Sign Language (BdSL) for recognizing 46 hand gestures, including 9 gestures for 11 vowels, 28 gestures for 39 consonants, and 9 gestures for 9 numerals according to the similarity of pronunciation. The image was first re-sized and then converted to a binary format to crop the region of interest by using only top-most, left-most, and right-most white pixels. The positions of the finger-tips were found by applying a fingertip finder algorithm. Eleven features were extracted from each image to train a multi-layered feed-forward neural network with a back-propagation training algorithm. The distance between the centroid of the hand region and each fingertip was calculated along with the angles between each fingertip and horizontal x-axis crossed the centroid. A database of 2300 images of Bengali signs was constructed to evaluate the effectiveness of the proposed system, where 70%, 15%, and 15% images were used for training, testing, and validating, respectively. The experimental results showed an average of 88.69% accuracy in recognizing BdSL which is very much promising compare to other existing methods.

Projects

Tic-Tac-Toe with Reinforcement Learning

Mohammad Samiul Arshad

Reinforcement Learning Game Theory

Trained two agent to play Tic-Tac-Toe using reinforcement learning.

MNIST Classification

Mohammad Samiul Arshad

Supervised Learning CNN

Trained a model to classify MNIST dataset.

Real Time Face Recognition

Mohammad Samiul Arshad

Supervised Learning CNN

Trained a model to identify and recognize human faces in real time.

Teaching

Teaching History:

Fall, 2018 - Spring, 2019

Design and Analysis of Algorithms and Data Structures

Assisted Dr. Bob Weems with grading assignments and quizzes. This course presents an overview of classic approaches to algorithm design - decomposition, dynamic programming, and greedy method, understanding of particular algorithms and data structures that have wide applicability. It also included basic algorithm analysis concepts by applying math skills to worst-case and expected time using recurrences and asymptotic notation and improved programming skills - especially data structures, recursion and graphs.
Summer, 2018

Distributed Systems

Assisted Dr. Chance Eary. This course includes multithreading, distributed systems, device drivers, object oriented operating systems, advanced file systems, parallel virtual machines, and load balancing. Examples from current popular modern systems and research operating systems are analyzed too.
Summer, 2018

Mobile Systems Engineering

Assisted Dr. Chance Eary. This course presents an overview of applications of mobile systems in health, entertainment, security, and other areas.
Fall, 2017 - Spring, 2018

Intorduction to Programming

This course introduces students to computers, to the algorithmic process and to programming using basic control and data structures.
Summer, 2017

Programming Languages

Assisted Dr. Ramez Elmasri. This course presents history of Programming Languages, overview of the scripting/mixed language: Python, functional programming languages paradigm: Haskell, overview of the logic programming language: Prolog, overview of syntax and semantics of programming languages.
Fall, 2016 - Summer, 2017

Intermediate Programming

Assisted several Professor with grading assignments and quizzes. This course presents Programming concepts beyond basic control and data structures. Emphasis is given to data structures including linked-lists and trees as well as modular design consistent with software engineering principles.

Contact

701, S Nedderman Dr

Arlington, TX 76019.
mohammadsamiul.arshad [at] mavs.uta.edu

samiularshad [at] gmail.com

Mohammad Samiul Arshad

Bio

Publications

Projects

Teaching

Contact

Bio

News

Experience

AI Research Intern

Graduate Researcher

Software Engineer

Education

Publications

Filter by type

Sort by year

IPVNet: Learning Implicit Point-Voxel Features for Open-Surface 3D Reconstruction

Abstract

LIST: Learning Implicitly from Spatial Transformers for Single-View 3D Reconstruction

Abstract

Automated Reconstruction of 3D Open Surfaces from Sparse Point Clouds

Abstract

A Progressive Conditional Generative Adversarial Network for Generating Dense and Colored 3D Point Clouds

Abstract

An Automated Bengali Sign Language Recognition System Based on Fingertip Finder Algorithm

Abstract

Projects

Tic-Tac-Toe with Reinforcement Learning

MNIST Classification

Real Time Face Recognition

Teaching

Teaching History:

Design and Analysis of Algorithms and Data Structures

Distributed Systems

Mobile Systems Engineering

Intorduction to Programming

Programming Languages

Intermediate Programming

Contact