superinference
v0.2.11
Published
Swiss army knife for inferring analysis-ready attributes from API endpoints and returning them as JSON objects
Downloads
12
Readme
Superinference
Superinference is a library that infers analysis-ready attributes from a person's social media username or unique identifier and returns them as JSON objects.
It supports both token-based (OAuth) authorization for authenticated requests and unauthenticated requests for public data. It currently supports the following social media channels:
- [x] GitHub
- [x] Dev.to
- [ ] Medium
- [ ] WordPress
Live Demo:
Use this codesandbox to quickly experiment with Superinference. A python library of Superinference is also available and maintained by tmtsmrsl.
Use Cases
You might use superinference to generate profile pages of your app users, or to enrich your user data with additional information by inferring them from their social media accounts. You might also use it to accelerate your account creation process by directly inferring attributes such as their email address, name, and profile picture.
Installation
You can install the package using yarn
or npm
:
yarn add superinference
# or:
npm install superinference
This will install superinference
from npm and add it to your package.json
file.
Usage
Common Patterns
There is nothing magic here. Superinference is just a wrapper around the social media APIs and so it's usage is very simple. Here is how you would extract and infer attributes from a person using he/her GitHub and Dev.to usernames:
import { inferFromDevto, inferFromGithub, inferFromGithubRepo } from "superinference";
// using then
let profile, skill, stats, contribution, bio, repo;
inferFromGithub({ githubHandle:"AurelliaChristie" }).then((data) => {
// do something with the data, such as setting states or updating UI
({ profile, skill, stats, contribution } = data);
});
inferFromDevto({ devtoHandle:"onlyphantom" }).then((data) => {
({ bio } = data);
});
inferFromGithubRepo({
repositoryName: "generations-frontend",
repositoryOwner: "onlyphantom",
}).then((data) => {
repo = data;
});
// using await
const { profile, skill, stats, contribution } = await inferFromGithub({ githubHandle:"AurelliaChristie" });
const bio = await inferFromDevto({ devtoHandle:"onlyphantom" });
const repo = await inferFromGithubRepo({
repositoryName: "generations-frontend",
repositoryOwner: "onlyphantom",
});
Here is the sample result for each of profile
, skill
, stats
, contribution
, and bio
created from the two function calls above:
// profile
{
"login": "AurelliaChristie",
"name": "Aurellia Christie",
"company": "@supertypeai ",
"blog": "",
"location": null,
"email": null,
"hireable": null,
"twitter_username": null,
"avatar_url": "https://avatars.githubusercontent.com/u/69672839?v=4",
"bio": "Full Stack Data Scientist at @supertypeai",
"followers": 8,
"following": 8
}
// skill (based on the user's owned repositories data)
{
"inference_from_originalrepo_count": 17,
"key_qualifications": [ "consultancy", "full-stack-developer" ],
"top_n_languages": [ "html", "javascript", "python" ],
"languages_percentage": {
// only available for authorized request
// otherwise will return null
"html": 0.500,
"javascript": 0.333,
"python": 0.278,
"css": 0.222,
"r": 0.111,
"jupyter-notebook": 0.056
}
}
// stats (repositories)
{
"incomplete_repo_results": false,
"inference_from_repo_count": 26,
"original_repo_count": 18,
"forked_repo_count": 9,
"stargazers_count": 2,
"forks_count": 5,
"top_repo_stars_forks": [
{
"name": "Ad-Fatigued-List-Generator",
"html_url": "https://github.com/AurelliaChristie/Ad-Fatigued-List-Generator",
"description": null,
"top_language": "Python",
"stargazers_count": 0,
"forks_count": 1
},
{
"name": "BeautIndonesia",
"html_url": "https://github.com/AurelliaChristie/BeautIndonesia",
...
},
{
"name": "cryptocurrency",
"html_url": "https://github.com/AurelliaChristie/cryptocurrency",
...
}
]
}
// contribution (only available for authorized request, otherwise will return null)
{
"contribution_count": 791,
"weekly_average_contribution": 5.774,
"contribution_count_per_day": {
"Wed": [ 101, 155 ], // the first value represents the contributions count in the last 12 months, while the second value represents the contributions count of all time
"Thu": [ 74, 163 ],
"Fri": [ 65, 180 ],
"Mon": [ 43, 92 ],
"Tue": [ 43, 104 ],
"Sun": [ 10, 51 ],
"Sat": [ 6, 46 ]
},
"contribution_count_per_month": {
"Mar": [ 119, 124 ], // the first value represents the contributions count in the last 12 months, while the second value represents the contributions count of all time
"Feb": [ 88, 88 ],
"Aug": [ 31, 37 ],
"Oct": [ 24, 159 ],
"Sep": [ 19, 73 ],
"Jul": [ 19, 47 ],
"Jun": [ 18, 18 ],
"Nov": [ 15, 74 ],
"Jan": [ 6, 134 ],
"Dec": [ 3, 37 ],
"Apr": [ 0, 0 ],
"May": [ 0, 0 ]
},
// the following 4 properties are inferred from the top 100 repos per year based on the total contributions count
"contribution_count_per_owned_repo": {
"BeautIndonesia": 84,
"TWO": 52,
"Skilvul-Tech4impact": 34,
"Ad-Fatigued-List-Generator": 32,
"Inventory-Management": 15,
"21_JSIntermediate_Code_Challenge": 15,
"Skilvul-Git-Second-Assignment": 14,
"Learning-Django": 9,
"Statistics-and-Microsoft-Excel-101": 7,
"Using-R-for-Time-Series-Stock-Analysis": 5,
"Multivariate-Analysis-McD-and-KFC-Nutrition-Facts": 5,
"cryptocurrency": 4,
"AurelliaChristie": 3,
"express-heroku-todolist": 2,
"dashboard-training": 2,
"Documentations": 2,
"supertype-fellowship": 1
},
"contribution_count_per_other_repo": [
{
"name": "generations-frontend",
"owner": "onlyphantom",
"html_url": "https://github.com/onlyphantom/generations-frontend",
"description": "Front end for Fellowship by @supertypeai",
"top_language": "javascript",
"contributions_count": 176
},
{
"name": "CookInd",
"owner": "Tech4Impact-21-22",
...
},
...
],
"contribution_count_per_repo_org_owner": {
"Tech4Impact-21-22": 124,
"supertypeai": 107,
"olahdata-ai": 2,
"impactbyte": 1,
"supabase": 1
},
"contribution_count_per_repo_user_owner": {
"AurelliaChristie": 286,
"onlyphantom": 215,
"Lathh": 18,
"vccalvin33": 9
},
// incoming contribution count (commits and pull requests from other users)
// only based on the top and latest 10 repositories
"external_contribution_to_top_10_repo": { "geraldbryan": 42 }
}
// bio
{
"username": "onlyphantom",
"name": "Samuel Chan",
"twitter_username": "_onlyphantom",
"github_username": "onlyphantom",
"summary": "Co-founder of Algoritma, a data science academy; https://supertype.ai, a full-cycle data science agency;",
"location": "Indonesia / Singapore",
"website_url": "https://www.youtube.com/samuelchan",
"joined_at": "Jul 3, 2019",
"profile_image": "https://res.cloudinary.com/practicaldev/image/fetch/...
}
// repo
{
"name": "generations-frontend",
"html_url": "https://github.com/onlyphantom/generations-frontend",
"description": "Front end for Fellowship by @supertypeai",
"owner_username": "onlyphantom",
"owner_html_url": "https://github.com/onlyphantom",
"topic": [],
"visibility": "public",
"created_at": "2022-05-10T15:58:48Z",
"last_pushed_at": "2023-03-20T10:54:51Z",
"top_language": "JavaScript",
"languages_percentage": [
{
"JavaScript": 0.966
},
{
"CSS": 0.018
},
{
"HTML": 0.016
}
],
"stargazers_count": 3,
"forks_count": 2,
"watchers_count": 3,
"subscribers_count": 2,
"open_issues_count": 0,
"incomplete_contribution_results": false,
"contributors_count": 2,
"contributions_count": 303,
"contributions": [
{
"contributor_username": "onlyphantom",
"contributor_html_url": "https://github.com/onlyphantom",
"contributor_repos_url": "https://api.github.com/users/onlyphantom/repos",
"contributor_type": "User",
"contributions": 183,
"contributions_percentage": 0.604
},
{
"contributor_username": "AurelliaChristie",
"contributor_html_url": "https://github.com/AurelliaChristie",
"contributor_repos_url": "https://api.github.com/users/AurelliaChristie/repos",
"contributor_type": "User",
"contributions": 120,
"contributions_percentage": 0.396
}
],
"incomplete_event_results": false,
"events_count": {
"PushEvent": 145,
"PullRequestEvent": 70,
"IssueCommentEvent": 34,
"GollumEvent": 1,
"ForkEvent": 1,
"CreateEvent": 1,
"DeleteEvent": 1
}
}
Authenticated Requests
The calls in the code example above are unauthorized requests, so it collects data from public profiles and returns information that is available to the public.
You can optionally pass in an OAuth token to make authenticated requests to, in the case of GitHub, also be able to extract and infer stats from private repositories not available to the public.
inferFromGithub({ githubHandle:"onlyphantom", token:oauth_token, include_private:true, top_repo_n:10, top_language_n:5 })
This returns the top 10 repositories, including private ones, and the top 5 languages using a GitHub OAuth token.
API Rate Limit
The APIs we use restrict the number of requests that can be made within a set timeframe. If this limit is exceeded, the API looping will cease and we will provide the inference from the data we have collected thus far. To see this information, you can check the following parameters included in the response:
incomplete_<item>_results
: Boolean that indicates if the results for<item>
are incomplete due to reaching the API rate limit.inference_from_<item>_count
: The number of<item>
got from the API (before reaching the API rate limit).
Special notes for GitHub API : the API can only return maximum 1,000 results (10 pages) per endpoint. Thus there will be a case where you see the incomplete_<item>_results
set to false
while the inference_from_<item>_count
set to 1,000
even though there supposed to be more than 1,000 <items>
.