Visual Detective Assignment Part 5

Visual Detective R Assignment

This assignment attempts to solve the 2021 IEEE Visual Analytics Science and Technology (VAST) Challenge: Mini-Challenge 2 by applying different visual analytics concepts, methods, and techniques with relevant R data visualisation and data analysis packages.

Archie Dolit https://www.linkedin.com/in/adolit/ (School of Computing and Information Systems, Singapore Management University)
07-25-2021

4. Proposed Solutions

Q5: Evidence of Suspicious Activities

Do you see evidence of suspicious activity? Identify 1- 10 locations where you believe the suspicious activity is occurring, and why.

Building on the POI network with employee interactions from Question 4, covert the gps coordinates to simple feature and plot it in the tourist map.

gps_poi_network_sf <- st_as_sf(gps_poi_network,
                               coords = c("long", "lat"),
                               crs= 4326)
gps_poi_network_sf
Simple feature collection with 773 features and 13 fields
Geometry type: POINT
Dimension:     XY
Bounding box:  xmin: 24.85088 ymin: 36.04802 xmax: 24.90814 ymax: 36.08962
Geodetic CRS:  WGS 84
# A tibble: 773 x 14
   timestamp           CarID date                day    hour Deparment
 * <dttm>              <fct> <dttm>              <ord> <int> <chr>    
 1 2014-01-06 07:34:01 10    2014-01-06 00:00:00 Mon       7 Executive
 2 2014-01-06 07:44:01 12    2014-01-06 00:00:00 Mon       7 Security 
 3 2014-01-06 07:59:01 8     2014-01-06 00:00:00 Mon       7 Informat~
 4 2014-01-06 08:03:01 13    2014-01-06 00:00:00 Mon       8 Security 
 5 2014-01-06 08:14:01 30    2014-01-06 00:00:00 Mon       8 Security 
 6 2014-01-06 08:17:01 22    2014-01-06 00:00:00 Mon       8 Security 
 7 2014-01-06 11:46:01 16    2014-01-06 00:00:00 Mon      11 Security 
 8 2014-01-06 11:46:01 107   2014-01-06 00:00:00 Mon      11 <NA>     
 9 2014-01-06 11:47:01 33    2014-01-06 00:00:00 Mon      11 Engineer~
10 2014-01-06 11:52:01 10    2014-01-06 00:00:00 Mon      11 Executive
# ... with 763 more rows, and 8 more variables: Title <chr>,
#   FullName <chr>, poi_diff <drtn>, poi <lgl>, lat_diff <dbl>,
#   long_diff <dbl>, close_contact <lgl>, geometry <POINT [°]>
gps_poi_network_points <- gps_poi_network_sf %>%
  select(timestamp,
         CarID,
         Deparment,
         Title,
         FullName,
         date,
         hour)

tmap_mode("view")
tm_shape(bgmap) +
  tm_rgb(bgmap, r = 1,g = 2,b = 3,
       alpha = NA,
       saturation = 1,
       interpolate = TRUE,
       max.value = 255) +
  tm_shape(gps_poi_network_points) +
  tm_dots(col = 'red', border.col = 'black', size = 1, alpha = 0.5, jitter = .8) +
  tm_facets(by = "date", ncol = 1)

1. Frydos Autosupply n More

This is the suspicious place because of the 10,000 spent on 2014-01-13. Additionally, members of the Security department frequently visit this place:

2. Spetsons Park

On January 07, 2014, 3:25, Isia Vann visited this place which is very unusual especially in the wee hours of the morning.

3. CEO’s house

On January 10, 2014, 23:23, Axel Calzas visited the place where the CEO is residing, he was followed by Kanon Herrero at arond 23:33 After a few hours, Felix Balas can also be seen around the vicinity On January 11, 2014, 00:25.

Photo Evidences|

4. Chostus Hotel

On January 08, 2014, around 13:00, both Brand Tempestad and Elsa Orilla were around the vicinity of hotel. This is unusual because it is still office hours on a weekday and they were in a hotel.

5. Warehouse near Sannan Park

On January 10, 2014, 22:20, Minke Mies visited this place. He also frequently visits the location around the Frydos Autosupply n More.

5. Conclusion

This assignment attempts to solve the 2021 VAST Challenge: Mini-Challenge 2 by applying different visual analytics concepts, methods, and techniques.

The interactive bar chart was used to identity the most popular locations which is Katerina’s Cafe while interactive heatmap was used to determine the day and time when GAStech employees visit the place. The interactive boxplot was used to perform initial analysis of outliers while plot_anomaly_diagnostics function was used to diagnose unusual purchases particularly the 10,000 transaction in Frydos Autosupply n More.

Interactive heatmap was also used to assess the anomalies where it shows the transactions with the missing credit card and loyalty card data. Adding the gps and car data and plotting the movement path using tmap, 4 employees were identified who may be involved in the suspicious transactions in Frydos Autosupply n More.

An approach was proposed to determine the owners of the loyalty and credit card data. It involves mapping the credit card transaction purchases timestamp against the interactive ‘Point of Interest’ map.

Similar to POI, relationship among the GASTech personnel was establish based on their ‘close contact’ with each other where they are meeting at the same place and at the same time. An interactive network graph and heatmap were used to show the GAStech personel relationships based on the number of their interactions.

Synthesizing the information from all the questions 1 to 4 and using interactive POI maps, several locations where identified to be the place where suspicious activities are happening.

Using relevant R data visualisation and data analysis packages, the previous submissions from 2014 VAST Challenge were enhanced by adding interactive features and making the visualisation reproducible.

Finally, this assignment can still be further improved by using RShiny App and have a more friendly user interface to perform the investigation.

6. References

Citation

For attribution, please cite this work as

Dolit (2021, July 25). Visual Analytics & Applications: Visual Detective Assignment Part 5. Retrieved from https://adolit-vaa.netlify.app/posts/2021-07-26-assignment-5/

BibTeX citation

@misc{dolit2021visual,
  author = {Dolit, Archie},
  title = {Visual Analytics & Applications: Visual Detective Assignment Part 5},
  url = {https://adolit-vaa.netlify.app/posts/2021-07-26-assignment-5/},
  year = {2021}
}