lab3 practice - Preuves
Captures et plans d’execution generes lors du lab.
Plans et logs
plan_iterative.txt
== Parsed Logical Plan ==
'Project [unresolvedstarwithcolumns(prediction, UDF('features) AS prediction#27184, None)]
+- Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, raw_features#4917, UDF(raw_features#4917) AS features#4970]
+- Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, UDF(struct(win_rate, win_rate#4888, pick_rate, pick_rate#4889, kda_ratio, kda_ratio#4890)) AS raw_features#4917]
+- LogicalRDD [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890], false
== Analyzed Logical Plan ==
hero_id: bigint, hero_name: string, win_rate: double, pick_rate: double, kda_ratio: double, raw_features: vector, features: vector, prediction: int
Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, raw_features#4917, features#4970, UDF(features#4970) AS prediction#27185]
+- Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, raw_features#4917, UDF(raw_features#4917) AS features#4970]
+- Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, UDF(struct(win_rate, win_rate#4888, pick_rate, pick_rate#4889, kda_ratio, kda_ratio#4890)) AS raw_features#4917]
+- LogicalRDD [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890], false
== Optimized Logical Plan ==
Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, raw_features#4917, features#4970, UDF(features#4970) AS prediction#27185]
+- InMemoryRelation [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, raw_features#4917, features#4970], StorageLevel(disk, memory, deserialized, 1 replicas)
+- *(1) Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, raw_features#4917, UDF(raw_features#4917) AS features#4970]
+- *(1) Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, UDF(struct(win_rate, win_rate#4888, pick_rate, pick_rate#4889, kda_ratio, kda_ratio#4890)) AS raw_features#4917]
+- *(1) Scan ExistingRDD[hero_id#4886L,hero_name#4887,win_rate#4888,pick_rate#4889,kda_ratio#4890]
== Physical Plan ==
*(1) Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, raw_features#4917, features#4970, UDF(features#4970) AS prediction#27185]
+- InMemoryTableScan [features#4970, hero_id#4886L, hero_name#4887, kda_ratio#4890, pick_rate#4889, raw_features#4917, win_rate#4888]
+- InMemoryRelation [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, raw_features#4917, features#4970], StorageLevel(disk, memory, deserialized, 1 replicas)
+- *(1) Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, raw_features#4917, UDF(raw_features#4917) AS features#4970]
+- *(1) Project [hero_id#4886L, hero_name#4887, win_rate#4888, pick_rate#4889, kda_ratio#4890, UDF(struct(win_rate, win_rate#4888, pick_rate, pick_rate#4889, kda_ratio, kda_ratio#4890)) AS raw_features#4917]
+- *(1) Scan ExistingRDD[hero_id#4886L,hero_name#4887,win_rate#4888,pick_rate#4889,kda_ratio#4890]
summary.txt
Lab 3 - Resume - 2026-05-03 15:26:14
Meilleure config sweep : KMeans k=6 silhouette=0.4635
Stabilite (KMeans k=6) : moyenne=0.4691, ecart-type=0.0080
Partition la plus rapide : repartition=2 clusters_reels=6 (966.1 ms)
Total de runs enregistres : 18