Seach the website
Login
Register
Dark Mode
Brightness
Ambient Glow – Questions list
Register
Profile
Edit Profile
Messages
My favorites
My Updates
Logout
Courses
Feedback
Filter
No answer
No selected answer
No upvoted answer
Recent questions without a selected answer
1
1 vote
1
1 answer
1.6k
1.6k views
How to analyse imbalanced categorical colum in dataset
Hello,I have a dataset with a categorical column that contains three categories. One of the categories represents 98% of the data, while the remaining 2% are distributed ...
Hagar
130
points
1
1
3
asked
Jun 24, 2023
Data Science
data-science
imbalanced-data
data-analysis
data-cleaning
+
–
0
0 votes
0
0 answers
1.3k
1.3k views
When to use one hot encode a category and when to segment by category?
When pre processing data for machine learning. Is there any difference in using one hot encoding to turn categoric variables into numeric variables or to segment the data...
NewDS64
120
points
1
1
3
asked
Feb 22, 2023
Machine Learning
machine-learning
preprocessing
+
–
0
0 votes
1
1 answer
7.6k
7.6k views
How to calculate the residual errors, (MSE),(MAE), and (RMSE)?
Given the following sample dataset with 5 samples and 2 features:SampleFeature 1Feature 2Actual ValuePredicted Value1234623456345674567856789Calculate the residual errors...
tofighi
116k
points
73
79
101
asked
Jan 26, 2023
Machine Learning
residual
linear-regression
ml-midterm
ml-exercise
machine-learning
ele888-midterm
+
–
0
0 votes
0
0 answers
1.3k
1.3k views
Can you verify the validity of this chart comparing the review scores for Marvel Phase 4?
I have some skepticism about the validity of the charts below comparing the critic and audience reviews for Phase 4 of the MCU to the previous 3 phases. There are over 18...
GammaRay247
120
points
1
1
3
asked
Jan 9, 2023
Exploratory Data Analysis
data-science
+
–
0
0 votes
0
0 answers
1.4k
1.4k views
Which code has best runtime and why?(the one commented or the other one)
# for key, value in dict.items(): # if value >= long: # long = value # long_name = key # if value < sh...
someone_prog
120
points
1
1
3
asked
Sep 2, 2022
Python
python
+
–
0
0 votes
0
0 answers
1.5k
1.5k views
Creating tables from unstructured texts about stock market
I am trying to extract information such as profits, revenues and others along with their corresponding dates and quarters from an unstructured text about stock market and...
messyaryal
120
points
1
1
3
asked
Aug 1, 2022
Machine Learning
nlp
machine-learning
data-science
information-extraction
ai
+
–
0
0 votes
0
0 answers
1.3k
1.3k views
How do I compare the count of a value in each year while having a different sanple size each year.
How do I accurately compare between the number of something a survey measure from my employees each year with a varying umber of survey engagement and employee size?If I ...
Nescafeadjust
120
points
1
1
3
asked
Jun 8, 2022
general
data-science
+
–
0
0 votes
0
0 answers
1.4k
1.4k views
Is it possible to make a forecast of a future value of Air Temperature using Fast Fourier Transform?
Is it possible to make a forecast of a future value of Air Temperature using Fast Fourier Transform, if yes, what should be the process or how you'll be able to do it. Th...
nuroxyjames
120
points
1
1
3
asked
Jun 2, 2022
Data Science
fft
data-science
fast-fourier-transform
spectral
analysis
prediction
forecasting
+
–
0
0 votes
0
0 answers
1.3k
1.3k views
forecast log transformed fitted values for 2 years using ARMA model
Input is a stock price in exponential transformation. We are asked to forecast using ARMA results for 2 years.
mbassoun
120
points
1
1
3
asked
May 4, 2022
Exploratory Data Analysis
data-science
predict
+
–
0
0 votes
0
0 answers
1.5k
1.5k views
Kmeans clustering in python - Giving original labels to predicted clusters
I have a dataset with 7 labels in the target variable.X = data.drop('target', axis=1) Y = data['target'] Y.unique()array(['Normal_Weight', 'Overweight_Level_I', 'Overweig...
Frenzy
120
points
2
2
4
asked
Apr 27, 2022
Machine Learning
python
machine-learning
clustering
scikit-learn
+
–
0
0 votes
0
0 answers
1.0k
1.0k views
Bankruptcy prediction and credit card
Hello everyone newbie data scientist here.I'm working on a project to predict companies (probability of default) bankruptcy probability and to assign them a credit rating...
Yassine
120
points
1
1
3
asked
Apr 10, 2022
Machine Learning
classification
deep-learning
data-science
linear-regression
+
–
0
0 votes
1
1 answer
1.8k
1.8k views
how to output f1-score instead of accuracy
I have the code below, outputting the accuracy. How can I output the F1-score instead? Thanks in advance,clf.fit(data_train,target_train) preds = clf.predict(data_test) #...
psil
120
points
1
1
3
asked
Apr 2, 2022
Python
machine-learning
f1score
accuracy
+
–
0
0 votes
0
0 answers
991
991 views
I cannot get this code to work. please help.
from keras.models import Sequential from keras.layers import Dense from keras.layers import LSTM from sklearn.model_selection import train_test_splitmodel = Sequential() ...
anonymous1200
120
points
1
1
3
asked
Mar 21, 2022
Python
machine-learning
data-science
neural-network
+
–
0
0 votes
0
0 answers
992
992 views
Battery data projects
Where can I find projects related to battery data?
alanq
130
points
1
1
4
asked
Mar 2, 2022
General
data-science
+
–
1
1 vote
0
0 answers
1.1k
1.1k views
How can you build dynamic pricing model with data only from rigid pricing?
I want to build a dynamic pricing model which means if product is too expansive for a client and there is a risk that we might loose a client we lower the price for them ...
Gwanza
130
points
1
1
3
asked
Jan 21, 2022
General
dynamic-pricing
data-science
predict
+
–
0
0 votes
0
0 answers
809
809 views
What analytical software would be good for a company to use?
This would be for a company that is just now looking into using a software to track data for wine making.
AverageJane
120
points
1
1
3
asked
Jan 14, 2022
Data Science
data-science-programming-software-
+
–
0
0 votes
0
0 answers
989
989 views
Do you usually collect you own data or there is always a resource available for you? Or it depends on the company?
matew
120
points
1
1
3
asked
Jan 9, 2022
Data Science Interview Questions
data-science
+
–
1
1 vote
1
1 answer
1.4k
1.4k views
When dealing with categorical values, should the 'year' column be encoded using OHE or OrdinalEncoder?
It's a car prices dataset, and so I'm assuming that the more recent the more value a car should have. The values in the 'year' column simply consist of years from 1995 to...
Anas
150
points
2
2
4
asked
Dec 18, 2021
Machine Learning
machine-learning
data-science
predict
onehotencoder
data-analysis
ordinal-encoder
+
–
0
0 votes
0
0 answers
549
549 views
How do I know which encoder to use to convert from categorical variables to numerical?
So say I have a column with categorical data like different styles of temperature: 'Lukewarm', 'Hot', 'Scalding', 'Cold', 'Frostbite',... etc.I know that we can use pd.ge...
Anas
150
points
2
2
4
asked
Nov 28, 2021
Exploratory Data Analysis
machine-learning
test-set
data
predict
onehotencoder
pandas
dataframe
data-science
data-analysis
data-cleaning
+
–
0
0 votes
0
0 answers
1.9k
1.9k views
ValueError: Length mismatch: Expected axis has 60 elements, new values have 2935849 elements
I'm creating a new data frame with the most used items grouped together. But I got the following error when grouping through ID and items. ValueError: Length mismatch: Ex...
aryansan
120
points
1
1
3
asked
Nov 26, 2021
Exploratory Data Analysis
data
python
+
–
Page:
1
2
3
4
...
18
next »