{ Stuart James }

Research interests

My research activities fit broadly into Spatial Reasoning — how we can reason about the layout of objects in space in both 2D and 3D to provide insight or retrieve relevant information. My research has a keen interest on varied data types including those from the Humanities such as Art and Cultural Heritage.

Exploring using **Depth** and **Knowledge** to answer questions specifically related to the layout of a 3D scene from a 2D perspective.

Visual Question & Answering

Detection, Representation and Reasoning on simplified representations or symbols such as Sketch, Line, Hatching, Motifs or icons.

Abstract & Iconography Reasoning

Identifying and retrieving relevant knowledge held within Knowledge Graphs to support Computer Vision tasks such as Visual Question and Answering or reasoning on location.

Knowledge Retrieval & Reasoning

Reconstructing the semantic relational structure of the scene using geometry and knowledge. Providing advanced interaction for questioning and reasoning.

Scene Graph

Principally on layout of content in 2D or 3D and how to make decisions that influence about a path or option linked with Visual Question and Answering

Planning & Reasoning

We have explored using sketches to search collections of videos using Visual Storyboarding to express the sequence of events in the target clip.

Sketch based Retrieval

We are using sequences to retrieve information providing a broader context than a one-off search. We have demonstrated through Free-Hand storyboarding and storey synthesis.

Visual Narratives and Stories

Within VR we explored the use free-hand sketching in an Immersive Environment (VR) with multiple modalities for the task of retrieval.

Interaction in Virtual Reality

Providing storytelling experiences overlaying information of surrounding Cultural Heritage and the stories of the particpants in the MEMEX Project.

Interaction in Augmented Reality

Cultural Heritage & Digital Humanities

Assistive Technologies

Robotics

Research Group & Collaborators

**Research Topic:** Causality and Representation learning

Supervisor with Dr Alessio Del Bue (IIT)

Davide Talon

PhD Student

**Research Topic:** Optimising camera localisation in urban scenes

Collaborator with Dr Alessio Del Bue (IIT)

Dr Matteo Toso

PostDoc Collaborator

**Research Topic:** RePAIR Fresco 3D reconstruction and assembly

Collaborator with Dr Alessio Del Bue (IIT)

Dr Theodore Tsesmelis

PostDoc Collaborator

Allumni

Mohamed Dahy Abdelaher Elkhouly

PhD Student

Dr Matteo Taiana

PostDoc Collaborator

Àlex Solé Gómez

Research Fellow

Dr Daniele Giunchi

External Collaborator

Openings

Looking to do a PhD?

Our group is always looking for good PhD candidates, so if you are interested in doing a PhD in Visual Reasoning please contact me to discuss the options. For more details review research areas and publications especially before making an inquiry or application.

Current Funding options:

Self-funded
Chinese Scholarships Council

Call for Interest in MSCA Postdoctoral Fellowships

Open call for interest in co-writing a MSCA Postdoctoral Fellowship on Computer Vision applied to the Arts and Humanities at Durham University. Wide array of topics we can discuss, but includes everything from digitisation to understanding and reasoning about art and heriage. The MSCA is an internationa collaborative program so a long-term secondment is required.

Project Duration: 1-2 Years

The EU provides support for the recruited researcher in the form of

a living allowance
a mobility allowance
if applicable, family, long-term leave and special needs allowances

In addition, funding is provided for

research, training and networking activities
management and indirect costs

Eligibility:

PhD or 4 years of full-time research experience

Dates:

Call opens 10 April 2024
Deadline 11 September 2024

Full details at https://marie-sklodowska-curie-actions.ec.europa.eu/actions/postdoctoral-fellowships

Feel free to contact me if you have any questions, please use "MSCA Postdoctoral Fellowships"" in the subject line.

Latest Blog Post

As of 1st September 2023, I will be taking up a position as Assistant Professor in Visual Computing at Durham University working in the VIViD group. This marks a major transition for me, as I move from being a contract-based Assistant Professor (or Researcher RTDa in the Italian system) to a permanent member of staff (i.e. Lecturer).

newsfeed

September 2024

Organising VISART VII: Vision for Art

July 2023

Co-Chair for the BMVA Computer Vision Summer School

September 2023

Joined Durham University as Assistant Professor (Lecturer)

October 2022

Organising VISART VI: Vision for Art

August 2022

Affiliated to Interactive Technologies Institute (ITI/LARSyS)

September 2021

New EU project - RePAIR

April 2021

Program Chair at BMVC’21

August 2020

Organising VISART V: Vision for Art

December 2019

Coordinating and Implementing MEMEX EU Project

December 2019

Rejoined Visual Geometry and Modelling Lab of IIT

January 2019

Researcher (Assist. Prof) in Cultural Heritage in IIT

April 2017

Moved to Istituto Italiano di Tecnologia (IIT)

April 2017

Became Honorary Research Associate at UCL

October 2016

Joined UCL Centre for Digital Humanities

April 2016

Invited to Rank Prize Symposium on Computer Vision and Video Effects

April 2016

Graduated as Doctor of Philosophy

October 2015

Research Associate at UCL with Prof. Tim Weyrich

September 2015

Defended PhD Thesis

Latest Publication

We introduce IFFNeRF to estimate the six degrees-of-freedom (6DoF) camera pose of a given image, building on the Neural Radiance Fields (NeRF) formulation. IFFNeRF is specifically designed to operate in real-time and eliminates the need for an initial pose guess that is proximate to the sought solution. IFFNeRF utilizes the Metropolis-Hasting algorithm to sample surface points from within the NeRF model. From these sampled points, we cast rays and deduce the color for each ray through pixel-level view synthesis. The camera pose can then be estimated as the solution to a Least Squares problem by selecting correspondences between the query image and the resulting bundle. We facilitate this process through a learned attention mechanism, bridging the query image embedding with the embedding of parameterized rays, thereby matching rays pertinent to the image. Through synthetic and real evaluation settings, we show that our method can improve the angular and translation error accuracy by 80.1% and 67.3%, respectively, compared to iNeRF while performing at 34fps on consumer hardware and not requiring the initial pose guess.

Accepted at International Conference on Robotics and Automation (ICRA) in Yokohama, Japan.

Contact

To find out more about our research you can find me at...

Research Scientist

Developer

Blogger

Runner

Climber

Drummer

Research interests

Visual Question & Answering

Abstract & Iconography Reasoning

Knowledge Retrieval & Reasoning

Scene Graph

Planning & Reasoning

Sketch based Retrieval

Visual Narratives and Stories

Interaction in Virtual Reality

Interaction in Augmented Reality

Cultural Heritage & Digital Humanities

Assistive Technologies

Robotics

Research Group & Collaborators

Davide Talon

Dr Matteo Toso

Dr Theodore Tsesmelis

Allumni

Mohamed Dahy Abdelaher Elkhouly

Dr Matteo Taiana

Àlex Solé Gómez

Dr Daniele Giunchi

Openings

Looking to do a PhD?

Call for Interest in MSCA Postdoctoral Fellowships

Latest Blog Post

18 Jul 2023 . research . New position at Durham University Comments

Read More

Previous posts

newsfeed

September 2024

July 2023

September 2023

October 2022

August 2022

September 2021

April 2021

August 2020

December 2019

December 2019

January 2019

April 2017

April 2017

October 2016

April 2016

April 2016

October 2015

September 2015

Latest Publication

2024 IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model

See full publication index

Contact

To find out more about our research you can find me at...

Department of Computer Science

Durham University

Room MS2099, Mathematical Sciences and Computer Science Building, Durham University, Upper Mountjoy, Stockton Road, DURHAM, DH1 3LE