I'm a second year computer science PhD student at North Carolina State University working with Dr. Tim Menzies on research projects data mining and devops.
This page was created with the intention of explaining in more details my current and past research projects, some of the classes that I took/am taking, and a list of all of my publications, awards and references. For more information, either refer to my resume or just contact me at gferrei [at] ncsu [dot] edu.
My research interests lie within the software science area, including but not limited to data mining, data analytics, machine learning and devops.
Current Research Projects
- In construction
Advisor: Tim Menzies
Past Research Projects
- Prediction of next-step in visual tasks
Advisor: Rada Chirkova
Prediction of next-step in visual tasks
- Extracting data records
Advisor: Luciano Barbosa
Automatic extraction of web records.
- Time-Relaxed Discovery of Flock Patterns
Advisor: Marcos Vieira
In this project I worked on discovering a specific type of group behavior pattern called flock, which can be roughly defined as a group of trajectories that stay "together" for the duration of a given time interval. My main contribution to this project was the relaxation of the time constraint, in order to discover additional flocks where the objects of the trajectories "escape" the flock a few time instances but eventually come back, and should still be regarded as a flock. I altered the problem's definition to encompass this broadened version, and also altered the previous algorithms to portray this change.
- Citizen Sensing: Spatio-Temporal Integration with JPA
Advisors: Marcos Vieira, Diego Gallo
The objective of this project was to provide spatio-temporal capabilities to a platform called Citizen Sensing. The platform focus on crowdsorcing as a way of collecting general and geospatial data regarding the problems of a certain city. A model application for the platform is called AccessibleWay, which provide means for citizens to report problems with the infrastructure of a city, such as broken sidewalks and lack of stop signs. My contribution to the project was to discover ways of handling spatial and temporal data with JPA (Java Persistence API), and more specifically Hibernate. I also migrated the system to JPA from the previous persistence method and added the spatio-temporal capabilities to it. This was a joint project between the Rio and the São Paulo labs of IBM Research - Brazil.
- Data Mining Techniques for Metaheuristics
Advisors: Alexandre Plastino, Isabel Rosseti
For this project, we used data mining techniques to improve the performance of metaheuristics (heuristics that seek to obtain approximate solutions to computationally hard problems) in obtaining solutions to combinatorial optimization problems. We incorporated data mining techniques in metaheuristics to enable the extraction of patterns that represent solutions with good quality for a particular optimization problem, and consequently use these patterns to guide the search for better solutions in less computational time.
- NoSQL DBMS's for Provenance in Scientific Workflows
Advisor: Daniel de Oliveira
As part of my undergraduate senior thesis, I performed a comparative study between relational DBMS's and NoSQL DBMS's for the management of provenance data in scientific workflows, while also implementing a NoSQL DBMS on a workflow management system, and testing it using a real bioinformatics workflow with data about gene codes. This project resulted on my 50+ pages undergraduate thesis and a paper that was published.
- Open Data and Cloud Computing e-Infrastructure for Biodiversity
Advisors: Vinod Rebello, Cristina Boeres
Performed research on cloud and distributed HPC technologies for an international project that deployed an e-Infrastructure of open access resources supporting the needs of the biodiversity scientific community.
- The Use of NoSQL DBMS's for the Management of Distributed Provenance in Scientific Workflows, Guilherme Ferreira, Carlos Filipe Jr., and Daniel de Oliveira. Proceedings of the 29th Brazilian Database Symposium(Acceptance rate: 32%). [Paper: pdf (portuguese)]
- Extracting Records and Posts from Forum Pages with Limited Supervision, Luciano Barbosa and Guilherme Ferreira. Proceedings of the 15th International Conference on Web Information Systems Engineering(Acceptance rate: 25%).
Awards and Honors
- Science Without Borders Scholarship, 2012
Competed with over 7000 top Brazilian undergraduate students for one of the 500 scholarships available at the time. The scholarship was for studying fully-funded for one year at a top university in the US.
- Computer Science Department, NCSU Assistantship, 2015-2018
Recipient of a 4 year assistantship with guaranteed full funding to study and perform research at North Carolina State University.
NC State courses
For my first semester at NC state, I'm taking two courses, which are:
- Database Management Concepts and Systems.
- Numerical Analysis.
I'm also a TA for 2 classes, Data Sciences and Multimedia Technology.
Fluminense Federal University courses
During my undergrad I took 50 different classes, composing of:
- 23 high level CS classes at Fluminense Federal University: Analysis and Design of Algorithms, Compilers, Software Engineering I, Software Engineering II, Database Systems, Graphic Computing, Distributed Systems, Computer Networks I, Computer Networks II, Data Structures II, Graphs, Advanced Programming Techniques, Scientific Programming, Programming Languages Theory, Computer Architecture, Operational Systems, Human-Computer Interaction, Introduction to Cryptography, Artificial Intelligence, Computer Science Theory, Computer-Based Data Management, Foundations of Mobile Systems and Applications, Information Security
- 9 General CS classes: Foundations of Computer Architecture, Computers and Society, Introduction to Computer Science, Programming I, Programming II, Digital Circuits, Digital Circuits Lab, Data Structures I, Logic and Computer-Based Business Systems.
- 2 Graduate level classes: Data Mining and Distributed Database Systems.
- 14 Math and Physics classes: Calculus I, Calculus II, Calculus III, Differential Calculus, Physics I, Experimental Physics I, Physics II, Experimental Physics II, Linear Algebra, Discrete Math, Statistics, Analytical Geometry, Numerical Methods.
- 2 Business classes: Intellectual Property and Organization Behaviour.
During my undergrad I also was the TA for 2 classes, Web Development and Database Systems.
- Tim Menzies, Assistant Professor, North Carolina State University
- Don Cronin, Program Director in DevOps Analytics, IBM
- Luciano Barbosa, Research Staff Member, IBM-Research
- Daniel de Oliveira, Professor, Fluminense Federal University