I Failed the Turing Test

  • Web Development
  • Database
  • DevOps
  • Python
  • JavaScript
  • Machine Learning
  • Big Data
  • Blockchain

All posts by Vinta

Build a recommender system with Spark: Content-based and Elasticsearch

Posted on 2017-10-102019-10-22Author VintaPosted in Big Data, Machine Learning

在本篇文章中,我們以 Candidate Generation 階段常用的方法之一:Content-based recommendation 基於內容的推薦為例,利用 Elasticsearch 的 More Like This query 建立一個 GitHub repositories 的推薦系統,以用戶最近打星過的 repo 作為輸入數據,比對出相似的其他 repo 作為候選物品集。

Spark ML cookbook (Scala)

Posted on 2017-09-022019-10-22Author VintaPosted in Big Data, Machine Learning

Scala is the first class citizen language for interacting with Apache Spark, but it's difficult to learn. This article is mostly about Spark ML - the new Spark Machine Learning library which was rewritten in DataFrame-based API.

Spark SQL cookbook (Scala)

Posted on 2017-09-022019-10-22Author VintaPosted in Big Data, Machine Learning

Scala is the first class citizen language for interacting with Apache Spark, but it's difficult to learn. This article is mostly about operating DataFrame or Dataset in Spark SQL.

Play with GitHub Archive Dataset on BigQuery

Posted on 2017-09-012019-10-29Author VintaPosted in Big Data

Google BigQuery is a web service that lets you do interactive analysis of very massive datasets - analyzing billions of rows in seconds.

Use Makefile as a task runner for arbitrary projects

Posted on 2017-08-312019-10-22Author VintaPosted in DevOps

Use the make, Luke!

Posts pagination

← Previous Page 1 … Page 9 Page 10 Page 11 … Page 24 Next →

Say Hello

TwitterFacebookLinkedInGitHubRSS

Recent Posts

  • GKE Autopilot Cluster: Pay for Pods, Not Nodes
  • Stop Paying for Kubernetes Load Balancers: Use Cloudflare Tunnel Instead
  • 1Password CLI: How NOT to Store Plaintext AWS Credentials or .env on Localhost
  • Solidity: call() vs delegatecall()
  • Solidity: Multicall - Aggregate Multiple Contract Calls
  • Demystifying Solidity ABI Encoding
  • Build Docker Images using Google Cloud Build from Your GitHub Repository
  • Solidity: Read Contract Storage by Slots with Foundry
  • Surviving the Digital Dark Forest: Tips for Staying Safe Online
  • Solidity: CREATE vs CREATE2
  • Solidity: calldata, memory, and storage
  • hardhat-deploy: Upgradeable Contracts with Linked Libraries
  • Deploy Ethereum RPC Provider Load Balancer with HAProxy in Kubernetes (AWS EKS)
  • Deploy graph-node (The Graph) in Kubernetes (AWS EKS)
  • Amazon EKS: Setup Cluster Autoscaler

Categories

  • Big Data (13)
  • Blockchain (11)
  • Database (15)
  • DevOps (36)
  • JavaScript (8)
  • Machine Learning (17)
  • Misc (6)
  • Python (48)
  • Web Development (40)

Tags

amazon web services apache spark aws cli aws cloudfront aws dynamodb aws eks aws lambda aws s3 cache celery chat bot ci cli tools cloudflare codetengu concurrency database migrations datetime debug django django models docker editor elasticsearch env ethereum ethers.js evm feature engineering foundry functional programming github go google bigquery google cloud platform graphql haproxy http io java javascript kubernetes linux linux sysadmin machine learning maven mongodb mysql networking nginx node.js profiling python recommender system redis scala search security smart-contract solidity subgraph testing tls transaction typescript visual studio code web crawler wordpress

Search

© 2025 I Failed the Turing Test. Stork Theme by JustGoodThemes.