Online Feature Release Notes

From Q
Jump to: navigation, search

The changes to the online features within Q and Displayr prior to August 2019 are listed below.

For online features released after July 2019 and all changes of the Q application please see here.

For details of new online features released since July 2019, see here.

July 2019

June 2019

May 2019

  • In Displayr, visualizations now allow row and columns to be selected using List Box or Combo Box controls.
  • Preliminary Project Setup QScripts can now be run in Displayr to produce summary plots and tables, identify flatlining, check for errors in data files, and more.
  • Utilities for "None of these" alternatives have been fixed when running the scripts in Choice Modeling - Save Variable(s) - Utilities.

April 2019

  • Data labels in stacked Column Chart are now by default vertically centered.
  • Several QScripts for creating variables that previously only worked in Q are now available in Displayr under Insert > Utilties > Create New Variables including Midpoint Coding and Quantification, Create New Top 2 Category Variables, and Case-Level Shares.
  • Updated to R 3.6.0. This includes a change to the default method for generating from a discrete uniform distribution to fix an existing bug. This may cause small change in R outputs using using imputed data, resampled (weighted) data, and many of the Machine Learning, Choice Modeling and MaxDiff outputs. To reproduce old results add 'RNGversion("3.5.3")' to the beginning of the R code.
  • Visualizations now allows the Legend to be set to Show, to force the legend to always be shown even for a single data series.
  • Added Text Analysis - Advanced - Map.
  • A template can created (in Displayr, Insert > Utilities > Visualization > Create Template; in Q, Automate > Browse Online Library > Visualization > Create Template) to control the fonts and colors used in all visualizations. The template also allows Brand colors to be associated with row/column names in the input data.

March 2019

  • Choice Modeling - Hierarchical Bayes now supports choice experiments with a dual-response 'none' question (i.e. questionnaires where the 'None' option is asked as a second stage question separate from the main choice task).
  • The mice package used for imputation has been updated to version 3.4.0. Some values may change in R outputs where missing data has been imputed.
  • The aspect ratio in radar plots are now fixed at 1 to avoid values along a particular axis looking disproportionately large.
  • Visualization - Scatterplot now has controls to adjust marker size.
  • Visualization - Time Series Graph now has controls to adjust range on the y-axis.
  • Most of the Visualizations (Bar, Column, Area, Line, Radar, Time Series) allow the legend to be orientated vertically (default) or horizontally.
  • Updated output of Text Analysis - Advanced - Setup Text Analysis to show both the transformed text and word frequency simultaneously. Tokens in the frequency table and transformed text are color coded for easier reading.

February 2019

  • Updated Visualization - Pictograph - Single Icon, Conditional Image, and Visualization - Number so that values can be entered as percentages instead of decimals (e.g "50%" instead of 0.5) into textboxes. This affects a number of controls including Input data, Thresholds to control the fill color and/or image, and Scale and Maximum. The Number type can also be set to "Percentage (no sign)" to hide the percentage sign in the output widget.
  • Updated Missing Data - Plot by Case. Label formatting has been improved and now has hovertext and data labels to make it easier to identify missing cases.
  • Version variables from response data are now accepted by Choice Modeling - Hierarchical Bayes and Choice Modeling - Latent Class Analysis, as an alternative to task variables.
  • Added Choice Modeling - Multinomial Logit and Marketing - MaxDiff - Multinomial Logit, which are equivalent to single-class latent class analysis.
  • Visualizations using Regression outputs as the Data Source now shows the importance scores (if available) or coefficients of the regression model.
  • Handling of variables as the data source in Visualizations has been improved. A grouping variable can be applied to multiple variables, and categorical variables can be treated as binary variables (previously they were always converted to numerical variables for aggregation).
  • For Visualization - Heatmap, convert to percentages/proportions computes the Total percentage instead of the Row percentage.

January 2019

  • Added Sparklines and Conditional Image to Visualization.
  • A non-backwards compatible change has been made to how percentage data is handled in Visualization - Pictograph - Single Icon, Conditional Image, and Visualization - Number. Percentages from table inputs are now stored as decimals. This means that old widgets with Format > DATA LABEL > Number type set to "Number" (which was previously the default) will have changed outputs, i.e 45% will be shown as '0.45' (or '0' with rounding). Set Format > DATA LABEL > Number type to "Automatic" or "Percentages" to show "45%" instead. (The Standard R page for these items has now been modified so the Number type is by default set to "Automatic", and percentages will automatically have the "%" suffix added).

December 2018

November 2018

October 2018

  • MaxDiff experimental designs can now be created with labeled alternatives (instead of numbers).
  • Stacked Bar and Columns charts can now be used with both positive and negative values
  • Updated Visualization - Pictograph - Single Icon and Visualization - Pictograph - Repeated Icon to use the same controls as Visualization - Number. This means that pictographs also allows conditional coloring, and can take input data from tables and variables.
  • New visualizations to display a single number in Visualization - Number (Displayr only) as a Number, Oval, Rectangle, Donut or Gauge. Controls also allow conditional coloring of the shape/icon.
  • Added option for Area, Bar, Column, Line, Radar and Pyramid charts to have data labels shown in different colors for each series.
  • Added option for data labels in Labeled Scatter plots to be automatically shown in the color of data series.
  • Added option to data labels in Stacked Area, Stacked Bar, Stacked Column and Pyramid charts to be automatically shown in black or white depending on the series color.
  • Visualization can now be applied to most of the non-tabular R outputs allowing customization. R outputs supported include Dimension Reduction (data is most appropriately shown as scatterplots), Regression and Machine Leaning (heatmaps), Choice and MaxDiff (distribution charts), Correlation matrix (heatmap), Confusion matrix (heatmap).
  • Added checkbox option to toggle the fitting of alternative-specific constants (coefficients) in Choice Modeling - Hierarchical Bayes and Choice Modeling - Latent Class Analysis.
  • Added ability to randomize order of alternatives in Marketing - MaxDiff - Experimental Design.
  • Version and Question columns added to Marketing - MaxDiff - Experimental Design, for consistency with multi-version designs.

September 2018

  • Updated color palettes in Visualizations. Primary colors has been removed, and Colorblind safe colors, Spectral colors have been added. Heat colors has also been reversed so larger values are shown in red.
  • Geolocation of IP addresses. Data - Geocode IPs outputs country and continent from IPv4 or IPv6 addresses.
  • Marketing - MaxDiff - Compare Models calculates a table of summary metrics allowing easy comparison of different MaxDiff models.
  • Marketing - MaxDiff - Ensemble creates an ensemble by averaging the respondent parameters of multiple MaxDiff models. Models must all use the same underlying data.
  • Compare Models and Ensemble for Choice Modeling. Analogous to MaxDiff as above.
  • Zip codes (or postcodes) added to Visualization - Geographic Map for Australia, UK and USA. Note the following:
    • Large maps (USA especially) may be slow to display. Turning off Show missing values improves the speed but may require Background map to provide context.
    • Zip code country is Automatic by default and will attempt to determine if the inputs are zip codes and from which country. Changing this option is helpful to resolve ambiguity.
    • US zip codes do not cover the whole of the land area and are not precisely defined.
    • US zip codes are 5-digit integers, Australian postcodes are 4-digit integers.
    • UK postcodes use only the district prefix e.g. BD14 (one or two letters followed by one or two digits).

August 2018

  • Upgrade to R version 3.5.1 (from 3.4.4). This may cause some differences in t-SNE and Multidimensional Scaling.
  • All Regression and Machine Learning features now share a common standard interface. The Algorithm control allows easy switching between models.
  • Added control to choose which dimensions to plot for Correspondence Analysis of a Table.
  • Discriminant functions output for Linear Discriminant Analysis has been changed from an output and is now run from Machine Learning - Diagnostic - Discriminant Functions. This allows discriminant functions to be shown without recalculating the analysis.
  • Amended Linear Discriminant Analysis to convert unordered categorical predictors to binary dummy variables. This is consistent with the approach of other predictive methods such as Regression, Deep Learning and Support Vector Machines. To revert to the previous approach of converting categories to numbers (e.g. red, green, blue -> 1, 2, 3) convert the predictor variable to numeric before using LDA. The approach of converting to numbers is still taken for ordered factors.
  • Added new Visualization chart type Pyramid
  • Amended Sankey Diagram to allow specifying the node width, node and link colors and label font type and font size.

July 2018

June 2018

May 2018

  • The mice package used for imputation has been updated to version 3.0.0. Some values may change in R outputs where missing data has been imputed.
  • MaxDiff analysis now automatically handles generic labels where the best and worst selections are indices amongst the alternatives show (instead of the usual labels of best and worst selections).
  • New controls added to Visualization for row and column manipulations of input data before charting.
  • New Visualization options to show multiple series as small multiples. Available for chart types 'Area', 'Bar', 'Column', 'Line', 'Geographic Map', and 'Radar'.
  • Added controls to Visualization to adjust values axis range in chart types 'Area', 'Bar', 'Column', 'Line', 'Geographic Map', and 'Radar'.
  • Added controls to Visualization to control fonts used in Geographic Map (with plotly package only).

April 2018

  • Log-likelihood and BIC statistics added to the outputs of MaxDiff and Choice Modeling.
  • MaxDiff Experimental Design now uses a 'pairwise balanced' method to create multiple-version designs where the frequencies of each pair of alternatives occurring together are balanced. The Detailed outputs includes additional diagnostics of balances between and across versions.
  • Deep Learning has been added to the machine learning and classifier menus. The model performs classification or regression using a neural network built with keras and tensorflow. Cross-validation is performed to find the optimal number of training epochs.
  • Added Choice Modeling - Experimental Design. 3 methods of data entry (individual attributes, number of levels per attribute and spreadsheet). Initially 4 algorithms are available, more to follow. Those algorithms are:
    • Random - randomly chooses each level of each attribute. Ensures the same alternative is not duplicated within a question.
    • Shortcut - ensures frequencies of each level of each attribute are balanced.
    • Complete enumeration - ensures frequencies of each level of each attribute are balanced and balance between pairs of levels across different attributes.
    • Balanced overlap - as per Complete enumeration except allows more overlap whereby the same level may be repeated within a question.
  • Added Custom Data Files - Survey Gizmo MaxDiff which makes it possible to analyse Survey Gizmo MaxDiff data in Q using Latent Class Analysis or Hierarchical Bayes.

March 2018

  • Visualization now has a checkbox so that users can create a crosstab when when multiple variables from 'Data' are selected as the data source. This provides an alternative to selecting a variable in the 'Groups' dropdown which is not always shown on the form.
  • Scatterplot can now handle multiple y-values. The user has a choice whether to treat variables/columns as colors and sizes or as additional y-values.See this document for examples of the various types of scatterplot.
  • MaxDiff improvements to footer clarifying that incomplete cases are removed from analysis.
  • Warning message for Visualizations with more than 10 variables, suggesting that creating a table first may be more appropriate.
  • The randomForest package used by Random Forest has been updated with various bug fixes. Notably "The tie-breaking mechanisms in various places were not implemented correctly. Maximum tree size was computed incorrectly. Splits on categorical variables with >=32 levels were not encoded correctly." The observed impact is that these changes are beneficial to prediction accuracy, although some changes may be expected.
  • The random number generation process of the Rcpp package used by t-SNE was corrected, which resulted in changes to t-SNE plots. Note that a random seed parameter seed can be set in the R code, resulting in different outputs.
  • Speed improvements for hierarchical Bayes analysis for MaxDiff and Choice Modeling

February 2018

  • New visualizations:
    • Distributions - Histogram, Box plot, Bean plot, Violin plot, Density plot
    • Venn diagram
    • Time series graph
    • Streamgraph