Feature selection in SVM

Machine learning

variable importance in SVMthe importance of predictor variables for individual classes in svmFeature slection Optimal feature selection for support vector machines Minh Hoai Nguyen , Fernando de la Torre

2018-12-05

R関数や引数の名前の意味

R

Rの関数や関数の引数では、多くの省略形で名前がついていいるが、何をabbreviate しているか以下に書いてあった。便利 http://jeromyanglim.blogspot.com/2010/05/abbreviations-of-r-commands-explained.htmlたとえば cex はcharacter extension の略 mgp …

2018-12-05

Radmacher 平均

Machine learning

Radmacher 平均Radmacher平均のテクニックを用いて、期待値と標本平均の差の期待値の上限を求める機械学習で有用http://ttic.uchicago.edu/~tewari/lectures/lecture9.pdf

2018-12-04

R plot 内に数式を書く

R

https://stats.biopapyrus.jp/r/graph/expression.html

2018-12-03

SVM and related methods

Machine learning

Support Vector Machine Support Vector Clustering Support Vector Regression https://cs.adelaide.edu.au/~chhshen/teaching/ML_SVR.pdf Transductive support vector machines

2018-11-28

Ｒで2標本問題

R Data Science

Ｒで2標本問題竹澤氏 http://cse.naro.affrc.go.jp/takezawa/r-tips/r/65.html松岡氏の記事 2標本問題の新展開~古典的手法からカーネル法まで~ https://qiita.com/yuchi_m/items/7132b426d848dc81ad9f

2018-11-17

統計的学習理論：リソース

Machine learning

統計的学習理論鈴木氏資料 http://ibisml.org/archive/ibis2012/ibis2012-suzuki.pdfVapnick氏論文 http://internet.math.arizona.edu/~hzhang/math574m/Read/vapnik.pdfSridharan氏の講義ノート（主に理論） http://www.cs.cornell.edu/courses/cs6783/2018…

2018-11-14

Causal Inference : Categorization

Causal inference

Causal Inference : CategorizationTreatments : 2値、多値、順序変数、連続型 Assignments: RCT, Regular assignments Estimand: ATE, ATT Dose-response function Response variables

2018-11-14

Causal Inference for Continuous Treatments

Causal inference R

Causal inference for continuous treatmentsEstimation of a continuous dose-response function 〇Hirano, K. and Imbens, G. W. (2004). The propensity score with continuous treatments. In Gelman, A. and Meng, X.-L., editors, Applied Bayesian Mo…

2018-11-11

Multi-valued treatment のPropensity Score

Causal inference

Multi-valued treatment のPropensity Scoreについての論文Imbens (2000), BiometrikaLechner, M. (2001): Identification and Estimation of Causal Effects of Multiple Treatments Under The Conditional Independence Assumption, in Econometric Evalua…

2018-11-03

R　glm, lm における欠損値の処理

R Data Science

R 欠損値の処理実際のデータ分析では欠損値が頻繁に表れる。特にglm, lm などでNAがあるobservations がdeleteされるので、その処理に以下のページの説明が非常に役立つ。関数complete.cases()を使う。https://stackoverflow.com/questions/23345419/r-fla…

2018-11-02

カーネル法、カーネルトリックのリソース

Machine learning

カーネル法、カーネルトリックのリソース結構ある @kilometer氏のブログ https://qiita.com/kilometer/items/58376b9a103743329b2f福水氏の資料 http://www.ism.ac.jp/~fukumizu/OsakaU2014/OsakaU_1intro.pdf再生核ヒルベルト空間、その他 Jordan氏の講義ノ…

2018-11-01

R ：データフレームの因子データを数値データに変換する

R

データフレームの因子データを一括で数値データに変換するyad 氏のブログのページ https://dev.classmethod.jp/statistics/r-dataframe-lapply/

2018-10-29

Support Vector Machine

Machine learning

Support Vector MachineSVMのリソースページ http://www.svms.org/「多変量解析入門」小西貞則著, (SVM、凸最適化の簡潔な説明) 「カーネル法入門」福水健次著、朝倉書店概説 https://en.wikipedia.org/wiki/Support_vector_machine (Empirical Risk Minimiz…

2018-10-27

R　列名を指定してデータを抽出

R

R 列名を指定してデータを抽出非常に便利。 character型のベクトルを使うことが重要ひつじ助氏のブログが有用 https://qiita.com/hitsujisuke/items/d71ee40daa0786ae1680character 型への変換はas.character() http://cse.naro.affrc.go.jp/takezawa/r-tip…

2018-10-25

r で括弧を描く：pBrackets

R

pBrackets パッケージで簡単に括弧を描けるhttps://cran.r-project.org/web/packages/pBrackets/pBrackets.pdf例 https://cran.r-project.org/web/packages/pBrackets/vignettes/pBrackets_vignette.pdf

2018-10-21

RでPropensity Score 分析

Causal inference R

RでPropensity Score 分析岩波データサイエンス加藤・星野論文のコードとデータがgithub にある https://github.com/iwanami-datascience/vol3CM接触とゲーム使用時間の因果性分析

2018-10-21

Rでmatching : ATE, ATTの推定

Causal inference R

Rでmatching Matching パッケージMatchIt パッケージ Ho, Imai, King, Stuart による解説 https://r.iq.harvard.edu/docs/matchit/2.4-20/matchit.pdfA Step-by-Step Guide to Propensity Score Matching in R 応用 CMの効果 https://tjo.hatenablog.com/ent…

2018-10-14

Rでロジットモデル推定

Rでロジットモデル推定glm()関数を使う http://cse.naro.affrc.go.jp/takezawa/r-tips/r/72.html

2018-10-14

CART : RでCART

Machine learning R

CART Classification and Regression TreeCART のまとめは一度書いたがもう一度トライ主に、Breiman Friedman, Olshen, Stone (BFOS)と下川・杉本・後藤に基づく。CARTは次の3つのプロセスから構成される (i) Tree を最大まで（基準を満たすまで）成長させ…

2018-10-13

RでCART, Random Forest：リソース

R Machine learning

RでCART rpart パッケージを使う。Recursive Partitioning: Large Companies and Glaucoma Diagnosis https://cran.r-project.org/web/packages/HSAUR/vignettes/Ch_recursive_partitioning.pdf (これが一番詳しい) https://www.r-bloggers.com/using-cart-i…

2018-10-11

Causal inference オリジナル文献

Causal inference

Causal inference 文献 Neyman, JerzyPotential outcomes Neyman, Jerzy. 1923 [1990]. “On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.” Statistical Science 5 (4): 465–472. Trans. Dorota M. …

2018-10-10

R　論理演算子

R

R 論理演算子http://cse.naro.affrc.go.jp/takezawa/r-tips/r/28.html ==, != など

2018-09-29

Random forest : Use of out-of-bag data

Data Science Machine learning

Random forest : Use of out-of-bag data Generalization error を推定するout-of-bag 推定という巧妙な方法がある。Generalization errorは ] で定義される。ただしはトレーニングセットRandom Forest は、Bagging (Bootstrap aggregation)とtreeのsplit …

2018-09-28

Causal Inference by Imbens and Rubin

Causal inference

Causal Inference by Imbens and Rubin Chapter 1 Causality : Basic Framework以下のサイトに資料あり https://www2.stat.duke.edu/courses/Spring14/sta320.01/Chapter 3 Assignment mechanism Def. 1 Individualistic assignment Def2. Probabilistic ass…

2018-09-27

Random forest : Statistical inference

Data Science Machine learning

Random Forest と統計的推論Athey and Wager Wager established asymptotic normality of predictor. Wager, Hastie and Efron (2014) proposed the infinitesimal jackknife to construct a confidence interval with Random Forest. Ｑ：regression tree …

2018-09-27

Boosting

Data Science Machine learning

BoostingBoosting の考え方は非常に面白い。 Base Learner の弱いところを見つけ、強化していく。 Boosting はメタアルゴリズム。（Base Learner がインプット。）複数回強化したBase Learner の重み付平均（重み付投票）が Boosting learner である。CART…

2018-09-22

CART 1

Data Science

CART Classification and Regression Tree Binary tree model Classifier Learning set the set of measurements the set of classes d: classifier (clasification rule) Criterion for which variable to use for splits Gini index をimpurity measure に…

2018-09-21

Bagging

Data Science

Bagging が有効な予測量 unstable な予測 CART regression after variable selection

2018-09-20

R 確率分布に関連する関数

R

R 確率分布に関連する関数 pnorm, dnorm, qnor,m rnorm http://cse.naro.affrc.go.jp/takezawa/r-tips/r/60.html

causal_statのノート

R, Tex, データサイエンスに関するノート

2018-01-01から1年間の記事一覧

Feature selection in SVM

R関数や引数の名前の意味

Radmacher 平均

R plot 内に数式を書く

SVM and related methods

Ｒで2標本問題

統計的学習理論：リソース

Causal Inference : Categorization

Causal Inference for Continuous Treatments

Multi-valued treatment のPropensity Score

R　glm, lm における欠損値の処理

カーネル法、カーネルトリックのリソース

R ：データフレームの因子データを数値データに変換する

Support Vector Machine

R　列名を指定してデータを抽出

r で括弧を描く：pBrackets

RでPropensity Score 分析

Rでmatching : ATE, ATTの推定

Rでロジットモデル推定

CART : RでCART

RでCART, Random Forest：リソース

Causal inference オリジナル文献

R　論理演算子

Random forest : Use of out-of-bag data

Causal Inference by Imbens and Rubin

Random forest : Statistical inference

Boosting

CART 1

Bagging

R 確率分布に関連する関数