Next Article in Journal
Urban Green Plastic Cover Mapping Based on VHR Remote Sensing Images and a Deep Semi-Supervised Learning Framework
Next Article in Special Issue
Linking Geosocial Sensing with the Socio-Demographic Fabric of Smart Cities
Previous Article in Journal
Urban Coastal Flood-Prone Mapping under the Combined Impact of Tidal Wave and Heavy Rainfall: A Proposal to the Existing National Standard
Previous Article in Special Issue
Geographical Structural Features of the WeChat Social Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Overview of Social Media Apps and their Potential Role in Geospatial Research

by
Innocensia Owuor
* and
Hartwig H. Hochmair
Geomatics Program, Fort Lauderdale Research and Education Center, University of Florida, 3205 College Ave, Davie, FL 33314, USA
*
Author to whom correspondence should be addressed.
ISPRS Int. J. Geo-Inf. 2020, 9(9), 526; https://0-doi-org.brum.beds.ac.uk/10.3390/ijgi9090526
Submission received: 16 July 2020 / Revised: 26 August 2020 / Accepted: 29 August 2020 / Published: 1 September 2020

Abstract

:
Social media apps provide analysts with a wide range of data to study behavioral aspects of our everyday lives and to answer societal questions. Although social media data analysis is booming, only a handful of prominent social media apps, such as Twitter, Foursquare/Swarm, Facebook, or LinkedIn are typically used for this purpose. However, there is a large selection of less known social media apps that go unnoticed in the scientific community. This paper reviews 110 social media apps and assesses their potential usability in geospatial research through providing metrics on selected characteristics. About half of the apps (57 out of 110) offer an Application Programming Interface (API) for data access, where rate limits, fee models, and type of spatial data available for download vary strongly between the different apps. To determine the current role and relevance of social media platforms that offer an API in academic research, a search for scientific papers on Google Scholar, the Association for Computing Machinery (ACM) Digital Library, and the Science Core Collection of the Web of Science (WoS) is conducted. This search revealed that Google Scholar returns the highest number of documents (Mean = 183,512) compared to ACM (Mean = 1895) and WoS (Mean = 1495), and that data and usage patterns from prominent social media apps are more frequently analyzed in research studies than those of less known apps. The WoS citation database was also used to generate lists of themes covered in academic publications that analyze the 57 social media platforms that offer an API. Results show that among these 57 platforms, for 26 apps at least some papers evolve around a geospatial discipline, such as Geography, Remote Sensing, Transportation, or Urban Planning. This analysis, therefore, connects apps with commonly used research themes, and together with tabulated API characteristics can help researchers to identify potentially suitable social media apps for their research. Word clouds generated from titles and abstracts of papers associated with the 57 platforms, grouped into seven thematic categories, show further refinement of topics addressed in the analysis of social media platforms. Considering various evaluation criteria, such as provision of geospatial data or the number (i.e., absence) of currently published research papers in connection with a social media platform, the study concludes that among the numerous social media apps available today, 17 less known apps deserve closer examination since they might be used to investigate previously underexplored research topics. It is hoped that this study can serve as a reference for the analysis of the social media landscape in the future.

1. Introduction

Numerous social media apps have evolved in the recent years and significantly enhanced information sharing and networking capabilities among their users. Undeniably, these apps have permeated many aspects of modern life in our society.
Largely aided by the proliferation of smartphones and their use today, social media apps are increasingly location-based, providing analysts with access to a wide range of shared spatial data, such as check-ins, geo-tagged images, video clips or text messages, or reviews of businesses and other localities. Based on these data, research studies provide valuable insights into spatio-temporal aspects of marketing, event detection, political campaigning, disaster management, migration, transport, natural resource management, human mobility, urban planning, tourism, epidemics, and communication [1,2,3,4,5,6].
Although hundreds of social media apps have been developed, only a handful of them are commonly used in research, including Twitter, Facebook, Flickr, Foursquare, YouTube, LinkedIn, or Yelp. The presence of user selection bias, based on age, gender, socio-economic status, etc., is a well-known phenomenon in social media use. For example, Twitter shows a bias towards male users and an under-sampling of Hispanic and African American users in certain regions of the United States [7]. User characteristics also vary between the different social media apps. For example, a survey conducted in the United States found that Snapchat and Instagram are used primarily by young people, while YouTube and Facebook are more widely used by older generations [8]. The same study showed that LinkedIn is more biased towards higher-income users (>$75,000 per year) than other apps such as WhatsApp or Snapchat. Popularity also varies per region. WeChat, for example, is popular in China [9] whereas Vkontakte is widely used in Russia [10]. In general, the country where an app has been developed draws also a strong user base for that app. Examples include Dronestagram (France) [11] or Mapillary (Sweden) [12]. Different apps vary in the topical focus and therefore attract different users. For instance, Doximity is popular with medical professionals whereas Dronestagram attracts drone pilots who like to share airborne imagery. Due to these many facets of user selection bias, analysis results drawn from a single app may be biased as well. In consequence, having access to data from a wider range of social media platforms that reach different user groups in different geographic regions might, therefore, help to reduce user selection bias and its effect on analysis results, if combined in joint data analysis.
Analyzing the prominence of different social media apps and existing access options to their data through application programming interfaces (APIs) provides an important knowledge base for researchers to potentially expand the range of social media apps used for spatio-temporal research endeavors. Using this and the need for a better understanding of the social media landscape as a motivation, this research reviews 110 social media apps and focuses on three objectives:
  • Identification of social media apps with APIs and the provision of location data through the APIs;
  • Determination of the number of research papers published between 1997 and April 2020 on Google Scholar, the Association for Computing Machinery (ACM) Digital Library, and the Web of Science (WoS) that are related to the different social media apps with APIs;
  • Extraction of research themes in academic literature covered in WoS for social media apps with APIs.
This study’s predominant aim is to bring to light less known apps that provide spatial data, which is central to geospatial research. Data extracted from these apps may offer researchers information that is unavailable from other platforms and can be potentially used to tackle new research questions and fill current research gaps. The characteristics provided about these apps in this study, e.g., API rate limits or general theme, can guide researchers towards a closer examination of specific apps that align with their research interests and that may have been previously gone unnoticed.
Whilst access to location data from social media apps has been beneficial in various ways, it has broached concerns among social media users about data protection and privacy especially in the aftermath of the Cambridge Analytica scandal [13]. Many policies that provide guidelines for safeguarding data privacy of social media users have been instituted by governments and social media companies alike in the recent past. Examples are the General Data Protection Regulation (GDPR) [14] and, more recently, the California Consumer Privacy Act (CCPA) [15]. These policies tend to raise challenges for harvesting data from social media apps, as will be discussed for selected apps in more detail.

2. Literature Review

Social media is a group of highly interactive platforms that use mobile and Web 2.0-based technologies to facilitate the creation and exchange of user-generated content [16,17]. Social media apps can be classified by purpose and function, which include social networking, microblogging, blogging, photo sharing, or crowdsourcing [18]. One of the first social media apps whose features are mirrored by today’s popular social networks, such as Facebook, was SixDegrees, which was launched in 1997 and shut down in 2001 [19]. The wide reach of social media benefits businesses [20], emergency and disaster management [21], and the tracking of epidemics and diseases [22,23]. Social scientists pioneered the use of social media data for ethnographic studies that give better insights about people’s behaviors and opinions [24,25,26]. Currently, over 9000 tweets are sent out per second and over 1000 photos are uploaded on Instagram per second [27]. This abundance of generated data is critical for real-time monitoring especially in emergency management and crisis mapping [22]. Research pertaining to social media has been largely facilitated by data access through APIs [28]. The availability of APIs in platforms such as Wikipedia, Twitter, Facebook, or Foursquare led to an increase in data services, software tools for analysis as well as social media analytics platforms [29]. API documentation provide detailed information about the API functionality and a description of data retrieval [30]. APIs allow third parties, such as games or productivity apps, to leverage friendship connections through regulated, programmatic access to recorded connections between registered users [31]. Access to public data that is published on the web may require login credentials [32] in order to be able to enforce download limits and to oversee access to data. For this purpose, social media sites have instituted authentications using keys and tokens. For example, Flickr allows up to 3600 queries per hour [33] and Twitter has varying rules for rate limiting on its APIs, such as 15 calls every 15 min [34]. Rate limits prevent abuse of data access and enhance security although this stifles generativity (i.e., third party use of the social graph to create innovative products or new insights into data [31]). API rate limiting can be implemented through request queues, throttling API calls (i.e., disconnect client or reduce bandwidth), and the implementation of rate limit algorithms [35].
Because of the Facebook Cambridge Analytica data breach, some social media apps have deprecated certain API methods or shut them down completely to protect user data and control its use. For example, Facebook recently further restricted data accessibility [36] and the 500px API is now available through paid subscription only [37].
The important role of social media for scientific research becomes evident by the sheer number of papers analyzing data from social media platforms. Various review studies and meta-analyses provide an overview of how data extracted from different social media platforms are analyzed, and how social media apps are used in different contexts and environments. For example, one paper reviews literature on the use of social media in academia [18]. It distinguishes between several categories of social media use, including social networking, social data sharing, video, blogging, microblogging, wikis, rating, and reviewing. It reports that the percentage of scholars who use social networking apps (e.g., Facebook, LinkedIn) for professional purposes is much lower than the percentage of scholars who use it for personal reasons. It also points towards a large variability in usage of different platforms among scholars, with numbers ranging between 10% for Twitter, 46% for ResearchGate, and 55% for YouTube. Another systematic review analyzes social media use for public health communication among the general public, patients, and health professionals based on 98 original research studies [38]. These studies included a range of social media tools and apps, with Facebook, blogs, Twitter, and YouTube being the most often reported tools. A review of 279 research papers describes the emerging role of social media in tourism and hospitality, which has only a short history [39]. Fewer than 10 papers were published in 2007, followed by a strong increase in the annual number of papers published. Popular keywords identified in analyzed publications include marketing, consumer behavior and user-generated content. A review of 48 publications that focused on the use of social media for the assessment of nature-based tourism showed that images from Flickr were most often used (36 papers), followed by Panoramio images (10 papers), and Instagram images (6 papers). Point location data collected from social media platforms were mentioned in 40 papers and temporal data in 12 papers [40].
Another study examines the role of social media, in particular, Facebook, as an educational tool in higher education [41], concluding that Facebook usage comes with benefits such as increased teacher-student and student-student interaction, improved performance, and the convenience of learning and higher engagement. A review paper [42] analyzed 412 articles on Facebook in the context of Social Science to answer various overarching questions, including: (a) Who is using Facebook? (analysis of users); (b) Why do people use Facebook? (motivation); (c) Why are people disclosing personal information on Facebook (privacy and information disclosure). An overview of social media use in college classrooms [43] concludes that LinkedIn is the most used social network by university faculty. Based on the review of 92 papers, another study analyzed which academic disciplines publish results from Twitter analysis, showing that 46% of papers come from Computer Science, followed by 30% from Information Science, and 7% from Earth- and Geoscience [44]. Furthermore, it showed that 46% of analyses fall into the category of event detection (disaster/health/disease/traffic management), 14% into social network analysis, and 13% focus on retrieving direct or indirect geolocation information from Twitter. A detailed list of publications in six leading tourism and hospitality journals that collected and analyzed online reviews from online platforms, such as Booking.com, Ctrip, Expedia, TripAdvisor, or Yelp, to address a variety of research questions, such as travel motivation, is provided elsewhere [45]. This study also reveals that online reviews could vary significantly across these platforms in terms of linguistic characteristics, semantic features, sentiment as well as impact on users of the websites (e.g., helpfulness). Fitness tracker apps, such as Strava or MapMyFitness, are primarily used to record an athlete’s sports activities in a digital diary or to draft training plans. However, these apps also have a social dimension, e.g., by allowing to share trips or photos with peers [46]. Numerous studies compare trip data from different fitness tracker apps, with the conclusion that the popularity of road network segments can be best assessed through a combination of data from different platforms [47].
GIScience has employed social media data in a variety of studies [48], for instance, by exploring ways of acquiring geospatial information from social media [49]. A systematic literature review of 690 papers in 20 social media platforms that compares methods for location extraction from social media [50] reveals that the use of social media platforms is heavily skewed towards Twitter (54.2% of papers) and Flickr (20.3% of papers) with many alternative social media platform being barely used for this purpose. It also shows that social media information has been extracted in the areas of tourism and recreation (27.2%), crisis and disaster management (12.6%), transport (9.2%), and health (8.1%), among others. Social media data is nowadays a vital source of Volunteered Geographic Information (VGI), and due to bias from social media users, data integrity is a big concern. Therefore, various quality assurance processes for VGI have been proposed including peer reviews and automated error checking [51].

3. Data and Methods

3.1. Data from Social Media Apps

A total of 110 social media apps were examined for this study (Table 1). We grouped them into the following nine thematic categories: general purpose, lifestyle, photo sharing, blogging, video sharing, business, reunion networks, gaming, and travel. The initial list of social media apps and categories were adapted from two sources [52,53]. Some of the apps listed in these sources were removed from the analysis since they were no longer active, namely 43Things, Delicious, Fotolog, Friendster, Kiwibox, Google+, Path, Sprybirds, StumbleUpon, TravBuddy, Tournac, Tout, Uplike, and Vine. New apps that were recently launched, such as Tiktok, Bumble, and Medium, were added to the list. Apps with an API are marked with an asterisk (*) in Table 1. Since the geographic prominence of an app can be influenced by the country in which it was launched, the table also provides the latter information for the listed apps. If not annotated differently, the default country where an app was launched is the United States.

3.2. Number of Related Research Papers

Three academic paper catalogs (Google Scholar, ACM Digital Library, WoS) were searched to (a) quantify research efforts related to the different social media platforms that provide an API and to (b) identify geospatial science topics covered in these research papers. Those three catalogs are commonly used for paper retrieval in review and meta-analysis studies [38,40,44].
Google Scholar is the most popular and largest search engine for scientific research books, articles, theses, abstracts, and court opinions from academic publishers, professional societies, online repositories, universities, and other web sites [54]. ACM Digital Library gives access to journals, conference proceedings, technical magazines, newsletters, books, and conference papers that cover a wide range of research themes but generally focus on computing [55]. WoS is a database that offers search capabilities for articles written back to 1900 and contains information gathered from thousands of scholarly journals, books, reports, and conferences [56]. It also has the “analyze results” feature that can be used to get an overview of thematic categories of articles, conference papers, and books that are returned from a search related to a specific social media app. This “analyze results” feature provides further information about funding agencies, type of document published (articles, reviews, or proceedings paper), countries where the research was conducted, and the authors that have investigated the topic, among others.
For the search process, the names of the social media apps with APIs (57 apps in total) were used as search parameter one after another to retrieve papers whose title, abstract, or keyword sections contained those names. Only in ACM can one conduct searches by exclusively looking up title, abstract, or keywords. Google enables searches for titles or the entire article whereas in WoS one can use the “Topic” field, which looks up the title, abstract, and keyword. One challenge in this process is semantic search limitations, especially with homonyms. For example, social media app names that are also English words such as “Discord”, “LINE”, and “Medium” led to inflated results from all the catalogs and especially for Google Scholar. To reduce the number of irrelevant hits in search results, the search term was combined with the word “app” for “Medium”, “LINE”, and “Minds” for all three search catalogs.

3.3. Analysis Methods

This research involves a variety of qualitative and quantitative methods for the analysis of 110 social media apps that are related to the three research objectives mentioned earlier.

3.3.1. Presence and Characteristics of APIs

Social media data can be acquired through web scraping, data resellers, or APIs. Examining the presence of an API is critical as it eases the data collection and integration process. API documentation provides information about data that are accessible through different API endpoints. For this research, availability of spatial data was one of the aspects of interest examined.
Through online search and exploration of API directories such as Programmableweb [57], it was determined which of the 110 apps provide an API. These shortlisted apps were subsequently closely examined with regards to API characteristics, such as rate limits.
API rate limiting affects the amount of data that can be accessed within a given time or it limits the number of requests or calls a client can make, respectively. It is critical for scalability, security, and sustainment of an API.
The API documentation was also examined for presence and type of location data that were collected by the different social media apps. Geographic data provided through social media comes typically in the form of text (place names, addresses), geographic coordinates (latitude and longitude), or coordinates of the centroid of an administrative boundary (e.g., city, county, or country). This, in turn, determines the spatial resolution of the social media data set, which will subsequently affect the type of research that can be carried out.
An independent samples T-test was conducted to examine whether apps with a larger user base were more likely to provide API data access. Similarly, a second T-test was conducted for apps with APIs to explore the relationship between user numbers and the availability of spatial data through the API.

3.3.2. Academic Research Extent and Research Themes

Social media platforms have been the subject of numerous research studies across a wide range of disciplines. Whereas especially prominent social media apps appear to have attracted the analysis of its data and users, little is known about the role of less prominent apps on research activities. To narrow this research gap and to better understand the extent and current trends in research related to these platforms, two types of analysis are conducted. First, by using the names of social media apps with APIs (see Table 1) as search criteria on Google Scholar, ACM Digital Library, and WoS, the number of published documents, as counted by these online catalogs, is retrieved and discussed. An independent samples T-Test is conducted for each of the three catalogs to examine whether a larger user base of a social media app is associated with a larger number of reported research documents. Second, to get an overview of the geospatial fields that apply social media analysis, the “analyze results” feature in the WoS platform was used for each social media app that comes with an API. This feature highlights different themes that the returned documents cover and illustrates, therefore, which social media app is associated with which discipline, including geospatial fields. The discovery of themes is based on title, abstract, and keywords of returned papers. The results of these two steps provide both quantitative and topical information about ongoing research related to a wide range of social media platforms with APIs. This information is meant to assist researchers with the identification of social media platforms that offer data related to their geospatial discipline (e.g., transportation) and in conveying a first idea about the abundance of research already conducted for each platform.
Furthermore, using title and abstract from each returned paper on WoS, a word cloud was created for selected categories of social media apps with APIs. This step highlighted terms commonly used in papers associated with different apps.

4. Results

4.1. User Statistics

The numbers of registered users for each of the 110 apps were retrieved through online searches. User statistics for the apps are summarized in Table 2, sorted in alphabetical order, together with year and month of the count. For 12 apps the number of registered users could not be determined. Facebook has the largest user base with over 2.4 billion users, followed by YouTube with 2 billion users. We hypothesize that user numbers can be used as a proxy for the availability of an API and the abundance of research literature found in relation to a social media app, which will be subsequently explored.

4.2. Availability of APIs and Spatial Data

APIs, where available, provide a gateway to social media app data either through a REST API or through GraphQL. The latter has recently become a more widely used alternative to REST APIs. Out of the 110 apps examined for this study, 57 apps provide an API (compare Table 1). Some apps provide multiple APIs for accessing different endpoints with specific datasets, as shown in Table 3.
Apps with large user numbers are more likely to provide an API than those with smaller user numbers. An independent samples T-test showed that the 53 out 57 apps with an API and known user counts had higher user numbers (Mean (M) = 345,582,491, Standard Deviation (SD) = 540,839,941) compared to the 53 apps without an API (M = 70,684,659, SD = 146,634,689), t(df) = 96, p < 0.001. Furthermore, among those apps with an API, an independent samples T-test showed that the mean number of users was higher for apps that provided spatial data in their API (M = 395,848,485, SD = 574,334,200) than for those apps that did not (M = 262,643,600, SD = 483,078,816), t(df) = 51, p = 0.015.
All APIs require users to register their apps to obtain API access credentials, such as an ID number and API key. API access requests for most apps were immediately approved, but for apps, such as Untappd or Facebook, this process takes some time.
Rate limiting methods were applied to numerous APIs (Table 3). For a few apps, such as DeviantArt, Discord, Last.fm, and LINE, which do not have a set rate limit, an effective rate limit varies depending on the frequency at which clients make requests and its effect of the app stability. For others, such as 23 and Me, eToro, and Goodreads, rate limitations were not disclosed. Doximity, Flickr, Imgur, Pinboard, and Pinterest have fixed API request rate limits. Besides setting rate limiting per given time, rate limits can also be implemented through fees. Most of the apps provide free access or a mix of free and paid subscriptions. For example, Twitter charges for access to its Premium and Enterprise APIs, whereas the Search and Streaming APIs are free of charge. Foursquare and Photobucket are free for noncommercial use but charge fees for commercial use of their APIs. As opposed to this, WhatsApp Business, Pinboard, and 500px APIs provide access through paid subscriptions only.
The apps that collect and share spatial data provide different types of location information, such as coordinates of the centroid of an administrative boundary. Based on information from the terms of service and privacy policies of various apps, location data was not provided in the APIs of a few apps even though it is collected in the background using IP addresses. In summary, 22 apps did not provide access to spatial data while 35 provided place name, geographic coordinates, or both through the API according to their API documentation (Table 3). Some APIs, such as that of Last.fm, allowed only running spatial queries, such as using the geo.getTopTracks or geo.getTopArtists API methods to query these endpoints using country names. Location data from several apps stems from user profiles only while a few apps, such as Twitter, Facebook, and Instagram provide spatial data affiliated with multimedia, such as pictures, videos, or text. 500px provides this information only once a paid subscription is made. The Flickr API is the only one that reports the accuracy of the coordinates it provides.
Although Table 3 provides information about rate limits, costs, and location data, additional limitations as to whose data can be downloaded may apply. In some cases, only one’s own data can be accessed through the API, or those from other users who gave specific permission as specified, for example, for the Facebook Graph API or the WeChat API [58,59].

4.3. Research Papers on Google Scholar, ACM Digital Library, and Web of Science

Apps such as Facebook, Instagram, Twitter, and YouTube had the highest numbers of papers published about them (Table 4), indicating that they are prominent platforms within the research community. These are also the apps with high user numbers. To quantify these results in more detail, first, the median of user numbers across social media apps in Table 2 (where available) was computed as 100,000,000. Next, each app was assigned to one of two groups, namely either the one group that contained apps with user numbers below the median or the other group with user numbers above the median. An independent samples T-test showed that in Google Scholar the apps falling into the upper user group had more paper counts (M = 331,516.9, SD = 543,184.4) compared to those in the lower user group (M = 75,528.58, SD = 227,738.4), t(df) = 49, p < 0.031. Similarly, in the ACM Digital Library the apps in the upper user group had more paper counts (M = 3766.5, SD = 6928.3) than those in the lower user group (M = 283.9, SD = 640.8), t(df) = 49, p < 0.014. Furthermore, in the WoS the apps in the upper user group had more paper counts (M = 3220.3, SD = 8115.4) than those in the lower user group (M = 139.8, SD = 320.5), t(df) = 49, p = 0.059. In the case of WoS, the level of significance for the difference in means is slightly above 5%, whereas for the other two search catalogs it is below that value.
Minds, Solaborate, Kickstarter, and Mocospace APIs were among the apps with the lowest totals across the three catalogs when queried for papers. “Discord”, “LINE”, “Medium”, which exist as English words, resulted in many irrelevant hits. XING, which is an app and a person’s name, led to unrelated results as the catalogs also tallied papers that were written by authors with that name. The latter problem could be mitigated by excluding author name in the search criteria from WoS whose results were also used to create the word cloud. Mocospace, ProductHunt, Ravelry, Solaborate, and Untappd did not have any papers affiliated with these social media platforms on WoS.
Table 4 reveals strong discrepancies in the number of returned papers between the three search portals. For each social media app, except for two platforms whose names also resemble an English term (Discord, Minds), Google Scholar by far outnumbers ACM and WoS search results. Google Scholar is one of the most widely used tools for researchers to search scientific information, and through parsing the entire Web its coverage is much more extensive than other multidisciplinary commercial databases like WoS or Scopus [60]. However, only up to 1000 search results can be displayed on the Google Scholar Website, and no API is available for additional search. Results from an earlier study show that 55% of the more than 2.6 million sample documents covered by WoS from 2009 and 2014 were freely available in some form from Google Scholar, e.g., as open access documents from publisher Websites or through links to other repositories, such as researchgate or arxiv [61]. ACM displays up to 2000 search results, where filters can be used to specify the subset of records to be displayed in cases where more than 2000 documents are returned upon a query. As opposed to this, the WoS allows to view all returned records by browsing through the result pages.

4.4. Research Themes

Research themes for papers returned from the WoS search were extracted using the “analyze results” feature. Out of the 57 apps with an API, 26 apps had spatial themes (Geography, Remote Sensing, Transportation, and Urban Planning). The frequency of the geospatial themes as found in the 3095 papers that are associated with those 26 apps are shown in Figure 1. The bars are overlaid with the number of papers that were found to be associated with each spatial theme for the different apps. The apps are sorted in descending order of the total number of papers found to be associated with a spatial theme, starting with XING on top (1113 papers), followed by Twitter (674 papers) and QQ (532 papers). Several apps had only one paper with a spatial theme, such as Tumblr or Reddit. The dominating theme is Geography, which can be found in papers for all apps listed in the chart. The other spatial themes are less frequently discussed in research outlets, i.e., Transportation for 73% of the apps, Remote Sensing for 62%, and Urban Planning for 69%. Common non-spatial themes found in papers related to apps with APIs include Materials science, Chemistry, Physics applied, Biochemistry molecular biology, Engineering electrical electronic, Oncology, Nanoscience, Communication, Computer science information systems, and Environmental Sciences. These themes are not listed in Figure 1 though.
The overview in Figure 1 adds to the existing body of knowledge by provision of topical research relevance for wide range of social media apps. As opposed to this, several earlier studies analyzed in-depth research activities associated with individual social media apps one at a time, such as Twitter [44] or Facebook [42]. These are, as can be seen in Figure 1, among the most widely analyzed social media platforms.
Figure 2 shows word clouds formed by papers from WoS that are associated with social media apps that provide APIs, where apps are grouped by theme according to Table 1. No word cloud was created for the gaming category because Mocospace, which is the only app with an API in that category, did not yield any papers from WoS. The key terms for each word cloud generally align well with the theme that the set of corresponding apps was assigned to in Table 1. For example, papers on blogging revolve around networking, social media, health, and community. Business app-related papers discuss trading, investment, decision making, and risk, whereas papers on photo sharing networks mention images, Instagram, Flickr, photo, or visual. The word clouds, therefore, demonstrate the different facets of activities and concepts that research papers discussing these apps cover in their analyses.

5. Discussion

According to Statista, more than 50% of the world’s population use social media [62]. This demonstrates the important role of social media apps today. Communication and user profile data as well as the user-generated content shared on these platforms provide a valuable resource for researchers across numerous disciplines. The user statistics summarized in this study give a good indication of the popularity of the different apps. Prominence of social apps, as it can be found, for example, for Facebook, Instagram, and YouTube, also comes with increased interest in the research community, as reflected in numerous research papers related to particularly prominent social media apps.
To stimulate innovation and help expand their business services, about half of the analyzed social media platforms give developers access to some of their data through APIs. This trend began in the early 2000s with e-commerce companies such as eBay and Amazon leading these efforts [63]. Statistical tests revealed that social media apps with a large user base also tend to provide APIs more often compared to those apps with lower user numbers. This is because development and support of an API require certain developer resources and infrastructure. In turn, provision of an API access may also lead to growth in user numbers as is the case with e-commerce platforms such as eBay because third-party developers are able to embed API functions onto their websites. In other words, companies behind apps with large user numbers have financial resources to develop and maintain APIs. Investment in an API pays off in the form of increased platform visibility and fees charged for the use of such an API for business services. For instance, Twitter as of 2011 got 15 billion calls a day on their site, three-quarter of which came through their API, and Facebook received 5 billion calls a day through its APIs [64]. All the APIs reviewed in this study required filling out an application form. For most apps, the approval process occurs almost immediately. However, for some apps, such as Untappd, the process can take up to three weeks [65] and for Instagram it can take one week or more [66]. The rigorous application process is aimed at tracking how the data acquired from these apps are used as users are asked to specify data use intentions during the application process [67]. 500 px and Pinboard are some of the apps where API users are charged a fee whereas for other apps API use was partially or entirely free. Foursquare and Twitter APIs are operated in a freemium type of model that offers basic data access for free for noncommercial use, but increased data access or commercial use are charged for. For instance, Foursquare provides twice the amount of data on the paid subscription of its API, and the premium and enterprise versions of the Twitter API give access to the full archive of tweets.
API rate limits express the allowable request per time interval and help with the stability and security of an app. Rate limiting techniques include maximum allowable requests per given time span, which, when exceeded, leads to the API calls being disconnected or the bandwidth being reduced. The Doximity API, for instance, allows its users to make 5000 API requests per hour whereas Flickr allows 3600 queries per hour. Twitter and Viber rate limits depend on the information being requested or the API call being made. Rate limits control the volume of data a developer can have access to at a given time. Access to data is also affected by a user’s privacy settings. The Facebook Graph API rate limit varies in that it implements a user level rate limiting per access token, whereas the app level limits are based on counts of the app ID being used. API level limits are based on number of API calls per second and IP level rate-limiting is based on the number of calls assigned to a particular IP address [68].
Since social media apps gather user data, in recent years, a variety of government-mandated policy changes regarding data privacy were initiated. An example is Europe’s GDPR, which aims at giving the users more control over how their data is collected, processed, stored, and transferred in the 28 member states of the European Union [69]. This has led to some social media apps restricting data access through their APIs as they changed their privacy policies to abide by the new regulation. This, in turn, affected data accessibility through the APIs. Facebook, for instance, rolled out new API restrictions in the wake of the Cambridge Analytica data scandal and GDPR which were aimed to close loopholes in the website privacy policies [36]. 500 px announced in 2018 that it will only provide paid access to its API in order to protect its data and improve its site performance [70]. Such changes in policies tend to further reduce the availability of social media app data for geospatial research because location data is widely considered personal or private data.
Location data is the backbone of geospatial research. APIs provide this data in the form of place names, geographic coordinates, address, time zone, or IP address, among others. Location data on social media can be extracted from message context or metadata, user profiles, the social network of friends and connections, tags, multimedia content (photos, videos), and links to other connected social media pages [50]. The main location types of data found on APIs in this study were place names and geographic coordinates, whereas some apps also collected IP address information. In some apps, location data is not made available through the API. For example, 23andMe, Imgur, and Last.fm collect location data from IP addresses but do not provide access to this data on their respective APIs. Additional restrictions hinder access to comprehensive location data through an API. When a user’s privacy settings do not allow sharing of this data then the information will not be retrievable. For example, the Facebook Graph API prevents the retrieval of data of a particular user due to their privacy settings [58]. In addition, a low percentage of social media content has been geotagged with geographic coordinates, for instance, only about 1-2% of tweets are geotagged [71]. The integrity of the location information extracted from social media can be questionable in some aspects as studies have found that some users provide incorrect locations intentionally in their user profiles [72].
Social media data has boosted research in various areas, which becomes evident from the number of papers retrieved from Google Scholar, the ACM Digital Library, and the WoS. The research themes on Figure 1, based on the papers retrieved from the WoS, illustrate the geospatial disciplines that have been researched in relation to the different apps. Among all 89,839 papers retrieved from the WoS for the 57 apps considered that provide an API, only 3095 are associated with geospatial disciplines. These 3095 papers are related to 26 out of the 57 apps. This low number of papers relative to the total of 89,839 papers reveals that the role of geospatial research within the entire social media research arena is a relatively small one. Instead, research in other disciplines, such as communication or computer science, more often report on social media data analysis.
There is a tendency that only a handful of apps are used for geospatial research even though other lesser known apps also provide datasets that could potentially drive research and reveal insights about various phenomena [50]. A total of 17 apps out of the 57 apps with APIs can in this study be identified as potential data sources for future research as they provide geographic data either from media or user profile information and come only with a small number of research papers published so far. These apps include DeviantArt, Discord, Doximity, eToro, Goodreads, Last.fm, LINE, LiveJournal, Mixi, MyHeritage, Photobucket, Plurk, Sina Weibo, Skyrock, Untappd, Viber, and Vkontakte. As an example of a successful attempt to retrieve geospatial information, Figure 3 depicts the distribution of location data obtained from the Plurk and DeviantArt APIs based on user profiles. The Plurk API facilitates batch processes such as adding friends or fans and getting bookmarks. The API can be also be used to look for search terms of interest which returns a JavaScript Object Notation (JSON) response with only the 20 latest plurks. The JSON report includes place names associated with the user profile locations. The red markers in Figure 3 show the profile locations of users who included the “Coronavirus” search term in their plurks. This data was accessed through the/APP/PlurkSearch/search endpoint. The DeviantArt API allows one to get the country names of watchers who are followers or fans of a DeviantArt user’s page. Using the/user/watchers/{username} endpoint facilitated the extraction of the location, i.e., the country name, of about 100 watchers of a user with the name boldfrontiers (black markers in Figure 3).

6. Conclusions

The overall goal of this study was to explore the potential of less known social media apps for geospatial research by evaluating 110 social media apps, among which more than half were found to provide APIs. APIs give developers data access and facilitate examination of the characteristics of data availed, including the provision of spatial data. As learned from the different API documentations, all the APIs considered store location data from their users in one form or another, but this information may not be available due to government regulations, app privacy policies, or a user’s privacy settings.
Recent governmental policy guidelines, such as GDPR and CCPA, have affected social media data access as they led to stricter rules in a wide range of social media apps. It will be of interest to investigate how this impacts data access and consequently research that relies on social media data in the long run. Facebook, which is one of the biggest social media platforms, introduced restrictions that curtail full data access in April of 2018 [36] as a result of GDPR. Besides this, another common obstacle in data extraction from APIs is frequent changes of policies, access regulations, functionality, or user registration and approval procedures, as well as outdated or missing API documentation. The latter necessitates substantial and time-consuming exploration, code testing, and internet search to obtain successful data access.
Among the list of 35 APIs that provide spatial data in form of place names or geographic coordinates, the spatial granularity of available data varies and, therefore, influences the type of research that it can be applied for. Spatial accuracy information from apps that provide coordinates is only given within the Flickr API. Aspects of future work include, therefore, the assessment of the quality of spatial data accessible in the 17 apps that provide location data through an API, even though they are rarely featured in geospatial research.
Social media sites collect an ever-increasing amount of data from their users, but privacy concerns may render access to these datasets impossible unless it is anonymized, and personal information is removed. Even if so, ethical use of the data is required. Therefore, the future may bring more rigorous approval procedures of API access requests with even follow-up audits to ensure that data is being used as intended. Future work will closely examine the consequences of such anticipated changes in API regulations on the social media analysis.

Author Contributions

Conceptualization, Hartwig H. Hochmair and Innocensia Owuor; methodology, Innocensia Owuor and Hartwig H. Hochmair; software, Innocensia Owuor; validation, Innocensia Owuor and Hartwig H. Hochmair; formal analysis, Innocensia Owuor; data curation, Innocensia Owuor; funding acquisition, Hartwig H. Hochmair; project administration, Hartwig H. Hochmair; resources, Hartwig H. Hochmair; supervision, Hartwig H. Hochmair; visualization, Innocensia Owuor; writing—original draft, Innocensia Owuor; writing—review and editing, Innocensia Owuor and Hartwig H. Hochmair. All authors have read and agreed to the published version of the manuscript.

Funding

We acknowledge financial support through a UF-CALS matching assistantship granted to the first author.

Acknowledgments

The authors thank two anonymous reviewers for their constructive feedback.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Spyratos, S.; Stathakis, D.; Lutz, M.; Tsinaraki, C. Using Foursquare place data for estimating building block use. Environ. Plan. B Urban Anal. City Sci. 2017, 44, 693–717. [Google Scholar] [CrossRef]
  2. Noulas, A.; Scellato, S.; Lambiotte, R.; Pontil, M.; Mascolo, C. A tale of many cities: Universal patterns in human urban mobility. PLoS ONE 2012, 7, e37027. [Google Scholar] [CrossRef]
  3. Han, S.Y.; Tsou, M.-H.; Knaap, E.; Rey, S.; Cao, G. How do cities flow in an emergency? Tracing human mobility patterns during a natural disaster with big data and geospatial data science. Urban Sci. 2019, 3, 51. [Google Scholar] [CrossRef] [Green Version]
  4. Lazer, D.M.J.; Baum, M.A.; Benkler, Y.; Berinsky, A.J.; Greenhill, K.M.; Menczer, F.; Metzger, M.J.; Nyhan, B.; Pennycook, G.; Rothschild, D.; et al. The science of fake news. Science 2018, 359, 1094–1096. [Google Scholar] [CrossRef] [PubMed]
  5. Cvetojevic, S.; Hochmair, H.H. Analyzing the spread of tweets in response to Paris attacks. Comput. Environ. Urban Syst. 2018, 71, 14–26. [Google Scholar] [CrossRef]
  6. Tenkanen, H.; Di Minin, E.; Heikinheimo, V.; Hausmann, A.; Herbst, M.; Kajala, L.; Toivonen, T. Instagram, Flickr, or Twitter: Assessing the usability of social media data for visitor monitoring in protected areas. Sci. Rep. 2017, 7, 17615. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Mislove, A.; Lehmann, S.; Ahn, Y.-Y.; Onnela, J.-P.; Rosenquist, J.N. Understanding the demographics of Twitter users. In Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, Barcelona, Catalonia, Spain, 17–21 July 2011. [Google Scholar]
  8. Smith, A.; Anderson, M. Social Media Use in 2018; Pew Research Center: Washington, DC, USA, 2018; pp. 1–4. [Google Scholar]
  9. Hou, J.; Ndasauka, Y.; Pan, X.; Chen, S.; Xu, F.; Zhang, X. Weibo or WeChat? Assessing preference for social networking sites and role of personality traits and psychological factors. Front. Psychol. 2018, 9. [Google Scholar] [CrossRef] [Green Version]
  10. Baran, K.S.; Stock, W.G. Facebook has been smacked down. The Russian special way of SNSs: Vkontakte as a case study. In Proceedings of the 2nd European Conference on Social Media (ECSM 2015), Porto, Portugal, 9–10 July 2015; pp. 574–582. [Google Scholar]
  11. Hochmair, H.H.; Zielstra, D. Analysing user contribution patterns of drone pictures to the dronestagram photo sharing portal. J. Spat. Sci. 2015, 60, 79–98. [Google Scholar] [CrossRef]
  12. Juhász, L.; Hochmair, H.H. User Contribution Patterns and Completeness Evaluation of Mapillary, a Crowdsourced Street Level Photo Service. Trans. GIS 2016, 20, 925–947. [Google Scholar] [CrossRef]
  13. Granville, K. Facebook and Cambridge Analytica: What You Need to Know as Fallout Widens. Available online: https://www.nytimes.com/2018/03/19/technology/facebook-cambridge-analytica-explained.html (accessed on 27 May 2020).
  14. Stewart, E. Why You’re Getting So Many Emails about Privacy Policies. Available online: https://www.vox.com/policy-and-politics/2018/4/5/17199754/what-is-gdpr-europe-data-privacy-facebook (accessed on 27 May 2020).
  15. Wong, Q. CCPA: What California’s New Privacy Law Means for Facebook, Twitter Users. Available online: https://www.cnet.com/news/ccpa-what-californias-new-privacy-law-means-for-facebook-twitter-users/ (accessed on 27 May 2020).
  16. Kaplan, A.M.; Haenlein, M. Users of the world, unite! The challenges and opportunities of Social Media. Bus. Horiz. 2010, 53, 59–68. [Google Scholar] [CrossRef]
  17. Kietzmann, J.H.; Hermkens, K.; McCarthy, I.P.; Silvestre, B.S. Social media? Get serious! Understanding the functional building blocks of social media. Bus. Horiz. 2011, 54, 241–251. [Google Scholar] [CrossRef] [Green Version]
  18. Sugimoto, C.R.; Work, S.; Larivière, V.; Haustein, S. Scholarly use of social media and altmetrics: A review of the literature. J. Assoc. Inf. Sci. Technol. 2017, 68, 2037–2062. [Google Scholar] [CrossRef] [Green Version]
  19. Boyd, D.M.; Ellison, N.B. Social network sites: Definition, history, and scholarship. J. Comput. Mediat. Commun. 2007, 13, 210–230. [Google Scholar] [CrossRef] [Green Version]
  20. Jones, N.; Borgman, R.; Ulusoy, E. Impact of social media on small businesses. J. Small Bus. Enterp. Dev. 2015, 22, 611–632. [Google Scholar] [CrossRef]
  21. Huang, Q.; Xiao, Y. Geographic situational awareness: Mining tweets for disaster preparedness, emergency response, impact, and recovery. ISPRS Int. J. Geo-Inf. 2015, 4, 1549–1568. [Google Scholar] [CrossRef] [Green Version]
  22. Jahanbin, K.; Rahmanian, V. Using twitter and web news mining to predict COVID-19 outbreak. Asian Pac. J. Trop. Med. 2020, 13, 4. [Google Scholar] [CrossRef]
  23. Stefanidis, A.; Vraga, E.; Lamprianidis, G.; Radzikowski, J.; Delamater, P.L.; Jacobsen, K.H.; Pfoser, D.; Croitoru, A.; Crooks, A. Zika in Twitter: Temporal variations of locations, actors, and concepts. JMIR Public Health Surveill. 2017, 3, e22. [Google Scholar] [CrossRef]
  24. Costa, S.; Meadows, J. The impact of computer usage on scholarly communication among social scientists. J. Inf. Sci. 2000, 26, 255–262. [Google Scholar] [CrossRef]
  25. Ackland, R. Social network services as data sources and platforms for e-Researching social networks. Soc. Sci. Comput. Rev. 2009, 27, 481–492. [Google Scholar] [CrossRef]
  26. Raento, M.; Oulasvirta, A.; Eagle, N. Smartphones: An emerging tool for social scientists. Sociol. Methods Res. 2009, 37, 426–454. [Google Scholar] [CrossRef] [Green Version]
  27. Internet Live Stats. Internet Usage & Social Media Statistics. Available online: https://www.internetlivestats.com/ (accessed on 27 May 2020).
  28. Venturini, T.; Rogers, R. “API-Based Research” or How can digital sociology and journalism studies learn from the facebook and cambridge analytica data breach. Digit. J. 2019, 7, 532–540. [Google Scholar] [CrossRef] [Green Version]
  29. Batrinca, B.; Treleaven, P.C. Social media analytics: A survey of techniques, tools and platforms. AI Soc. 2015, 30, 89–116. [Google Scholar] [CrossRef] [Green Version]
  30. Zhong, H.; Zhang, L.; Xie, T.; Mei, H. Inferring resource specifications from natural language API documentation. In Proceedings of the 2009 IEEE/ACM International Conference on Automated Software Engineering, Auckland, New Zealand, 16–20 November 2009; pp. 307–318. [Google Scholar]
  31. Hogan, B. Social media giveth, social media taketh away: Facebook, friendships, and APIs. Int. J. Commun. 2018, 12, 592–611. [Google Scholar]
  32. Lomborg, S.; Bechmann, A. Using APIs for Data Collection on Social Media. Inf. Soc. 2014, 30, 256–265. [Google Scholar] [CrossRef] [Green Version]
  33. Flickr. The Flickr Developer Guide—API. Available online: https://www.flickr.com/services/developer/api/ (accessed on 27 May 2020).
  34. Twitter. Rate Limiting. Available online: https://developer.twitter.com/en/docs/basics/rate-limiting (accessed on 27 May 2020).
  35. Simpson, J. Everything You Need to Know About API Rate Limiting. Available online: https://nordicapis.com/everything-you-need-to-know-about-api-rate-limiting/ (accessed on 28 May 2020).
  36. Schroepfer, M. Facebook: An Update on Our Plans to Restrict Data Access on Facebook. Available online: https://about.fb.com/news/2018/04/restricting-data-access/ (accessed on 27 May 2020).
  37. Goodreads. API. Available online: https://www.goodreads.com/api (accessed on 27 May 2020).
  38. Moorhead, S.A.; Hazlett, D.E.; Harrison, L.; Carroll, J.K.; Irwin, A.; Hoving, C. A New Dimension of Health Care: Systematic Review of the Uses, Benefits, and Limitations of Social Media for Health Communication. J. Med Internet Res. 2013, 15, e85. [Google Scholar] [CrossRef] [Green Version]
  39. Zeng, B.; Gerritsen, R. What do we know about social media in tourism? A review. Tour. Manag. Perspect. 2014, 10, 27–36. [Google Scholar] [CrossRef]
  40. Teles da Mota, V.; Pickering, C. Using social media to assess nature-based tourism: Current research and future trends. J. Outdoor Recreat. Tour. 2020, 30, 100295. [Google Scholar] [CrossRef]
  41. Chugh, R.; Ruhi, U. Social media in higher education: A literature review of Facebook. Educ. Inf. Technol. 2018, 23, 605–616. [Google Scholar] [CrossRef]
  42. Wilson, R.E.; Gosling, S.D.; Graham, L.T. A review of Facebook research in the social sciences. Perspect. Psychol. Sci. 2012, 7, 203–220. [Google Scholar] [CrossRef] [Green Version]
  43. Cooper, B.; Naatus, M.K. Linkedin as a learning tool in business education. Am. J. Bus. Educ. (AJBE) 2014, 7, 299–306. [Google Scholar] [CrossRef]
  44. Steiger, E.; de Albuquerque, J.P.; Zipf, A. An Advanced Systematic Literature Review on Spatiotemporal Analyses of Twitter Data. Trans. GIS 2015, 19, 809–834. [Google Scholar] [CrossRef]
  45. Xiang, Z.; Du, Q.; Ma, Y.; Fan, W. A comparative analysis of major online review platforms: Implications for social media analytics in hospitality and tourism. Tour. Manag. 2017, 58, 51–65. [Google Scholar] [CrossRef]
  46. Romanillos, G.; Austwick, M.Z.; Ettema, D.; Kruijf, J.D. Big data and cycling. Transp. Rev. 2016, 36, 114–133. [Google Scholar] [CrossRef]
  47. Norman, P.; Pickering, C.M. Using volunteered geographic information to assess park visitation: Comparing three on-line platforms. Appl. Geogr. 2017, 89, 163–172. [Google Scholar] [CrossRef]
  48. Sui, D.; Goodchild, M. The convergence of GIS and social media: Challenges for GIScience. Int. J. Geogr. Inf. Sci. 2011, 25, 1737–1748. [Google Scholar] [CrossRef]
  49. Stefanidis, A.; Crooks, A.; Radzikowski, J. Harvesting ambient geospatial information from social media feeds. GeoJournal 2013, 78, 319–338. [Google Scholar] [CrossRef]
  50. Stock, K. Mining location from social media: A systematic review. Comput. Environ. Urban Syst. 2018, 71, 209–240. [Google Scholar] [CrossRef]
  51. Yan, Y.; Feng, C.-C.; Huang, W.; Fan, H.; Wang, Y.-C.; Zipf, A. Volunteered geographic information research in the first decade: A narrative review of selected journal articles in GIScience. Int. J. Geogr. Inf. Sci. 2020, 34, 1765–1791. [Google Scholar] [CrossRef]
  52. Mehra, G. 105 Leading Social Networks Worldwide. Available online: https://www.practicalecommerce.com/105-leading-social-networks-worldwide (accessed on 27 May 2020).
  53. Spencer, J. 65+ Social Networking Sites You Need to Know About in 2020. Available online: https://makeawebsitehub.com/social-media-sites/ (accessed on 27 May 2020).
  54. Google Scholar. About Google Scholar. Available online: https://0-scholar-google-com.brum.beds.ac.uk/intl/en/scholar/about.html (accessed on 24 August 2020).
  55. ACM Digital Library. About ACM DL. Available online: https://0-dl-acm-org.brum.beds.ac.uk/about (accessed on 24 August 2020).
  56. Web of Science. Web of Science Core Collection Help. Available online: http://0-images-webofknowledge-com.brum.beds.ac.uk//WOKRS535R90/help/WOS/hp_database.html (accessed on 24 August 2020).
  57. ProgrammableWeb. ProgrammableWeb. Available online: https://www.programmableweb.com/ (accessed on 19 August 2020).
  58. Facebook. Using the Graph API Documentation. Available online: https://developers.facebook.com/docs/graph-api/using-graph-api/ (accessed on 19 August 2020).
  59. Weixin. wx.getLocation (Object object). Weixin Public doc. Available online: https://developers.weixin.qq.com/miniprogram/en/dev/api/location/wx.getLocation.html (accessed on 24 August 2020).
  60. Orduna-Malea, E.; Ayllón, J.M.; Martín-Martín, A.; Delgado López-Cózar, E. Methods for estimating the size of Google Scholar. Scientometrics 2015, 104, 931–949. [Google Scholar] [CrossRef] [Green Version]
  61. Martín-Martín, A.; Costas, R.; van Leeuwen, T.; Delgado López-Cózar, E. Evidence of open access of scientific publications in Google Scholar: A large-scale analysis. J. Informetr. 2018. [Google Scholar] [CrossRef] [Green Version]
  62. Clement, J. Number of Social Media Users Worldwide. Available online: https://0-www-statista-com.brum.beds.ac.uk/statistics/278414/number-of-worldwide-social-network-users/ (accessed on 19 August 2020).
  63. API Evangelist. History of APIs. Available online: https://history.apievangelist.com/ (accessed on 19 August 2020).
  64. Jacobson, D.; Woods, D.; Brail, G. APIs: A Strategy Guide; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2011; ISBN 978-1-4493-0892-6. [Google Scholar]
  65. Untappd. Untappd API Registration. Available online: https://untappd.com/api/register?notoken=true (accessed on 27 May 2020).
  66. Instagram. Instagram Developer Documentation. Available online: https://www.instagram.com/developer/ (accessed on 25 August 2020).
  67. Facebook. Facebook for Developers: Introduction. Available online: https://developers.facebook.com/docs/app-review/introduction/ (accessed on 19 August 2020).
  68. Sloan, L.; Quan-Haase, A. The SAGE Handbook of Social Media Research Methods; SAGE: London, UK, 2017; ISBN 978-1-4739-8721-0. [Google Scholar]
  69. Wolford, B. What is GDPR, the EU’s New Data Protection Law? Available online: https://gdpr.eu/what-is-gdpr/ (accessed on 19 August 2020).
  70. 500px. 500px Support. Available online: http://support.500px.com/hc/en-us/articles/360002435653 (accessed on 27 May 2020).
  71. Kotzias, D.; Lappas, T.; Gunopulos, D. Addressing the sparsity of location information on Twitter. In Proceedings of the EDBT/ICDT Workshops, Athens, Greece, 28 March 2014; pp. 339–346. [Google Scholar]
  72. Hecht, B.; Hong, L.; Suh, B.; Chi, E.H. Tweets from Justin Bieber’s heart: The dynamics of the location field in user profiles. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Association for Computing Machinery, New York, NY, USA, 7–12 May 2011; pp. 237–246. [Google Scholar]
Figure 1. Spatial themes of papers associated with social media apps.
Figure 1. Spatial themes of papers associated with social media apps.
Ijgi 09 00526 g001
Figure 2. Word clouds for different categories of social media apps.
Figure 2. Word clouds for different categories of social media apps.
Ijgi 09 00526 g002
Figure 3. Location data from Plurk and DeviantArt.
Figure 3. Location data from Plurk and DeviantArt.
Ijgi 09 00526 g003
Table 1. Social media apps categorized by theme together with the country of the app launch (AR…Argentina, CA…Canada, CN…China, DE…Germany, FR…France, IL…Israel, JP…Japan, KR…Korea, LU…Luxembourg, RU…Russia, TW…Taiwan, UK…United Kingdom), where * indicates availability of an API.
Table 1. Social media apps categorized by theme together with the country of the app launch (AR…Argentina, CA…Canada, CN…China, DE…Germany, FR…France, IL…Israel, JP…Japan, KR…Korea, LU…Luxembourg, RU…Russia, TW…Taiwan, UK…United Kingdom), where * indicates availability of an API.
General Purpose Social Networks
AngelListGabFacebook Messenger * QQ * CNSkyrock * FRTwoo
Badoo * UKGirlsAskGuysMixi * JPQuoraSnapchat * Vero *
Baidu Tieba * CNKickstarter * MyspaceQZoneSpreelyViber * LU
Discord * LINE * KRNextdoorReddit * TaggedVkontakte * RU
DoubanLinkedIn * Pinboard * RenrenTelegram * DEWeChat * CN
Facebook * MeetMePinterest * Sina Weibo * CNThe DotsWhatsApp *
Foursquare * Meetup * ProductHunt * Skype * Twitter * Yelp *
Lifestyle
Academia.eduCafeMomEnglish,baby!InternationsPlentyofFishUntappd *
ASmallWorldCare2FlixsterItalki.com Ravelry * VampireFreaks
BlackPlanetCaringBridgeGaiaLast.fm * UK ReverbNation
BumbleCrokesGlocalsMinds * SoundCloud * DE
BusuuCross.tvGoFundMe* MyMFBTaringa! * AR
BuzznetDoximity* Goodreads* NexopiaTinder
Photo sharing
500px * CADronestagramFotkiInstagram * Pixabay * DESnapfish
DeviantArt * Flickr * Imgur * Photobucket * Shutterfly * WeHeartIt
Blogging
LiveJournal * RUMedium * Plurk * TWTumblr * Xanga
Video sharing
FunnyOrDieTiktokYouTube * YY
Business
eToro * ILRyzeSolaborate * Viadeo * FRXING * DE
Reunion networks
23andMe * Ancestry.comClassmatesMyHeritage * IL
Gaming
CellufunHabboMocoSpace * Zynga
Travel
CouchSurfingWayn
Table 2. User statistics.
Table 2. User statistics.
Social Media Sites with Approximate User Numbers
23andMe-10 mio (April 2020)GoFundMe-70 mio (April 2020)ReverbNation-4 mio (2017)
500px-15 mio (April 2020)Goodreads-90 mio (April 2020)Ryze-1 mio (April 2020)
Academia.edu-122 mio (April 2020)Habbo-120 mio (2019)Sina Weibo-497 mio (April 2020)
Ancestry.com-3 mio (April 2020)Imgur-300 mio (April 2020)Shutterfly-10 mio (2018)
AngelList-10 mio (2019)Internations-3.6 mio (2019)Skype-300 mio (2019)
ASmallWorld-28,500 (2018)Instagram-1 bio (April 2020)Skyrock-Unknown
Badoo-471 mio (April 2020)Italki.com-5 mio (April 2020)Snapchat-382 mio (April 2020)
Baidu Tieba (Postbar)-1.5 bio (2017)Kickstarter-18 mio (April 2020)Solaborate- Unknown
BlackPlanet- 15.8 mio (2007)Last.fm-60 mio (April 2020)SoundCloud-175 mio (April 2020)
Bumble-5.03 mio (2020)LINE-600 mio (2017)Snapfish-100 mio (April 2020)
Busuu-100 mio (2020)LinkedIn-675 mio (April 2020)Spreely-Unknown
Buzznet-UnknownLive Journal-UnknownTagged-20 mio (2017)
Cafemom-146 mio (April 2020)Medium-60 mio (2017)Taringa!-30 mio (April 2020)
Care2-55 mio (April 2020)MeetMe-UnknownTelegram-200 mio (April 2020)
Cellufun-2 mio (2017)Meetup-25.5 mio (2016)The Dots-300,000 (2018)
Classmates-70 mio (2015)Facebook Messenger-1.3 bio (April 2020)Tiktok-800 mio (April 2020)
CaringBridge-500,000 (2013)Mixi-14 mio (2013)Tinder-50 mio (2018)
Crokes-46,000 (April 2020)Minds-1.25 mio (2018)Tumblr-23 mio (2019)
Cross.tv-743,829 (April 2020)MocoSpace-2 mio (2017)Twitter-340 mio (April 2020)
Couchsurfing-12 mio (April 2020)MyMFB (LABYRA)-UnknownTwoo-13 mio (April 2020)
DeviantArt-48 mio (April 2020)MyHeritage-106 mio (April 2020)Untappd-5 mio (2017)
Discord-250 mio (2020)myspace-50 mio (2015)VampireFreaks-2 Million (2009)
Dronestagram-30,000 (2016)Nexopia-1.4 mio (2012)Vero-3 mio (2018)
Douban-60 mio (2017)Nextdoor-UnknownViadeo-50 mio (2017)
Doximity-1 mio (2017)Photobucket-90 mio (April 2020)Viber-1.1 bio (2019)
eToro-4.5 mio (2019)Pinboard-22,000 (2013)VKontakte (VK)-400 mio (2017)
English, baby!-1.6 mio (2012)Pinterest-322 mio (2020)Wayn-20 mio (2017)
Facebook-2.449 bio (2020)Pixabay-UnknownWe Heart It-40 mio (2015)
Flickr-100 mio (April 2020)PlentyofFish-150 mio (April 2020)WeChat-1.151 bio (April 2020)
Flixster (Fandango)-67 mio (April 2020)Plurk-100 mio (2013)Whatsapp-1.6 bio (April 2020)
Fotki-UnknownProductHunt-600,000 (2018)Xanga-27 mio (2006)
Foursquare-50 mio (April 2020)QQ-731 mio (April 2020)XING-10 mio (2016)
Funny or Die-2,830,000 (April 2020)Quora-300 mio (2018)Yelp-176 mio (2019)
Gab-200,000 (2017)QZone-517 mio (April 2020)YouTube-2 bio (April 2020)
Gaia Online-31 (April 2020)Ravelry-9 mio (April 2020)YY-Unknown
GirlsAskGuys-UnknownReddit-430 mio (April 2020)Zynga-232 mio (2019)
Glocals-152,790 (April 2020)Renren-31 mio (2018)
Table 3. Characteristics of APIs and location data.
Table 3. Characteristics of APIs and location data.
SiteRate LimitsCostsAPI Name(s)Location Data
23andMeNot specifiedFree23andMe Collected but not available through API
500pxPay to accessPaid500px Available after paid subscription
BadooNot specifiedFreeBadooNot specified
Baidu TiebaNot specifiedNot specifiedBaidu Tieba Not specified
DeviantArtVariesFreeDeviantArt & Sta.sh, oEmbed, RSSPlace name
DiscordVariesFreeDiscord Place name
Doximity5000 requests per hourFreeDoximity Place name, coordinates
eToroNot specifiedFreeDiscovery, Metadata, System, User Place name
FacebookVariesFreeGraph, Marketing, Live Video, Pages Place name, coordinates
Flickr3600 queries per hourFreeFlickr Rest Coordinates
Foursquare VariesFree and paidPlaces Place name, coordinates
GoodreadsNot specifiedFreeGoodreads Place name/query per location
GoFundMeNot specifiedFreeGoFundme Place name
Imgur12,500 requests per dayFreeImgur Collected but not available through API
InstagramVariesFree Basic Display, Graph Coordinates
KickstarterNot specifiedFreeKickstarter StatusNot available
Last.fmVariesFreeLast.fm Collected but not available through API (query per location)
LINEVariesFree and paidMessaging Place name, coordinates
LinkedInVariesFreeLinkedIn Place name
LiveJournalVariesFreeLiveJournal Place name
MediumNot specifiedFreeMedium Collected but not available through API
MeetupNot specifiedFreeMeetup Rest Place name, coordinates
Facebook MessengerVariesFreeAttachment Upload, Broadcast, Customer Matching, Handover Protocol, ID Matching, Messenger Code,
Messaging Insights, Messenger Profile, Personas, Send, Messaging Feature Review
Coordinates
MixiNot specifiedFreeGraph -People, Groups, People lookup, Voice, Updates, Check, Photo, Message, Diary, Check-in, Profile Image, Persistence, Calendar Place name, coordinates
MocoSpaceNot specifiedFreeMocoSpaceNot specified
MindsNot specifiedFreeMinds Not specified
MyHeritageNot specifiedFreeFamily Graph Place name
PhotobucketVariesFree and paidPhotobucket Coordinates
Pinboard1 call per user every 3 sPaidPinboard Not specified
PinterestApproved app can make 100 calls/day FreePinterest Place name
Pixabay5000 requests per hourFreePixabay Collected but not available through API (query per location)
PlurkNot specifiedFreePlurk Place Name
ProductHuntVariesFreeProductHunt GraphQL Collected but not available through API
QQNot specifiedFreeUser Information, Relationship Chain, Application promotion, Payment Not specified
RavelryNot specifiedFreeRavelry Collected but not available through API
Reddit60 requests per minuteFreeReddit Place name
Sina WeiboVariesFreeSina Weibo Rest, Search, Geo, Test Place name, coordinates
ShutterflyNot specifiedNot specifiedShutterfly Collected but not available through API
SkypeNot specifiedFreeSkypePlace name
SkyrockVariesFreeSkyrock Place name, coordinates
SolaborateNot specifiedFreeSolaborateNot specified
SoundCloudVariesFreeSoundCloud Place name, coordinates
SnapchatNot specifiedFreeSnap KitCollected but not available through API
Taringa!Not specifiedFreeTaringa! Rest Not specified
TelegramVariesFreeTelegram Rest, Bot Place name
TumblrVariesFreeTumblr Place name
TwitterVariesFree and paidAds, Search Tweets, Streaming, and Direct Message.Place name, coordinates
Untappd100 calls per keyFreeUntappd Place name, coordinates
VeroNot specifiedNot specifiedVero Collected but not available through API
ViadeoLast 50 items of endpointsFreeViadeo Collected but not available through API
ViberVariesFreeViber Place name, coordinates
VKontakte VariesFreeVKontakte Place name
WeChatNot specifiedNot specifiedWeChat Place name, coordinates
WhatsAppVariesPaidWhatsApp BusinessNot specified
XINGVariesNot specifiedXING E-RecruitingPlace name
Yelp5000 API calls per dayFreeFusion Place name, coordinates
YouTubeVariesFreeIframe Player, Data, Analytics, Live Streaming, ReportingPlace name
Table 4. Number of research papers associated with the different social media platforms.
Table 4. Number of research papers associated with the different social media platforms.
SiteGoogleACMWoSSiteGoogleACMWoS
23andMe959050656Pinterest274,000714214
500px2910364Pixabay25,4003624
Badoo47803317Plurk15,6003716
Baidu Tieba 16801273 8ProductHunt33200
DeviantArt55607212QQ1,010,000243237503
Discord3344095Ravelry889300
Doximity1090168Reddit307,0001353267
eToro128027Shutterfly1140326
Facebook1,830,00022,10815,663Sina Weibo32,1001430392
Facebook Messenger291,00024,14478Skype76,7003333918
Flickr139,0005485924Skyrock442083
Foursquare23,0001846345Snapchat18,500659234
Goodreads14,9009946Solaborate900
GoFundMe32402929SoundCloud138,80015420
Imgur4200589Taringa!78602204
Instagram194,00025031492Telegram68,000342312
Kickstarter15,500310186Tumblr229,000563183
Last.fm15,000826102Twitter1,700,0001723512111
LINE 13302311Untappd23230
LinkedIn525,0002998603Vero556,0001258
LiveJournal24,10090573Viadeo359043
Medium 41132Viber15,60014941
Meetup11,60032650VKontakte 16,0006995
Minds 100 158WeChat345,000707731
Mixi12,40012911WhatsApp39,6001347954
Mocospace17930XING1,040,00027731540
MyHeritage14,100910Yelp20,5001342349
Photobucket5210712YouTube1,360,00097444381
Pinboard28204714

Share and Cite

MDPI and ACS Style

Owuor, I.; Hochmair, H.H. An Overview of Social Media Apps and their Potential Role in Geospatial Research. ISPRS Int. J. Geo-Inf. 2020, 9, 526. https://0-doi-org.brum.beds.ac.uk/10.3390/ijgi9090526

AMA Style

Owuor I, Hochmair HH. An Overview of Social Media Apps and their Potential Role in Geospatial Research. ISPRS International Journal of Geo-Information. 2020; 9(9):526. https://0-doi-org.brum.beds.ac.uk/10.3390/ijgi9090526

Chicago/Turabian Style

Owuor, Innocensia, and Hartwig H. Hochmair. 2020. "An Overview of Social Media Apps and their Potential Role in Geospatial Research" ISPRS International Journal of Geo-Information 9, no. 9: 526. https://0-doi-org.brum.beds.ac.uk/10.3390/ijgi9090526

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop